Db 



181 LVFFATSF 188 



RESULT 62 
F69260 

nitrate ABC transporter, permease protein (nrtB-1) homolog - Archaeoglobus 
f ulgidus 

C; Species: Archaeoglobus fulgidus 

C;Date: 05-Dec-1997 #sequence_revision 05-Dec-1997 #text_change 21-Jul-2000 
C; Accession: F69260 

R;Klenk, H.P.; Clayton, R.A. ; Tomb, J.F.; White, 0.; Nelson, K.E.; Ketchum, 
K.A. ; Dodson, R.J.; Gwinn, M. ; Hickey, E.K.; Peterson, J.D.; Richardson, D.L.; 
Kerlavage, A.R.; Graham, D.E.; Kyrpides, N.C.; Fleischmann, R.D. ; Quackenbush, 
J.; Lee, N.H.; Sutton, G.G.; Gill, S.; Kirkness, E.F.; Dougherty, B.A.; McKenny, 
K. ; Adams, M.D.; Loftus, B.; Peterson, S.; Reich, C.I.; McNeil, L.K.; Badger, 
J.H.; Glodek, A.; Zhou, L.; Overbeek, R. ; Gocayne, J.D.; Weidman, J.F.; 
McDonald, L. 

Nature 390, 364-370, 1997 

A;Authors: Utterback, T.; Cotton, M.D.; Spriggs, T.; Artiach, P.; Kaine, B.P.; 
Sykes, S.M.; Sadow, P.W.; D ! Andrea, K.P.; Bowman, C. ; Fujii, C. ; Garland, S.A.; 
Mason, T.M.; Olsen, G.J.; Fraser, CM.; Smith, H.O.; Woese, C.R.; Venter, J.C. 
A; Title: The complete genome sequence of the hyperthermophilic, sulf ate-reducing 
archaeon Archaeoglobus fulgidus . 

A; Reference number: A69250; MUID: 98049343; PMID: 9389475 
A;Accession: F69260 

A; Status: preliminary; nucleic acid sequence not shown; translation not shown 
A; Molecule type: DNA 
A; Residues: 1-244 <KLE> 

A; Cross-references: GB:AE001100; GB:AE000782; NID : g268 9423 ; PIDN : AAB9114 3 . 1 ; 
PID:g2650562; TIGR:AF0086 

C; Superf amily: Synechococcus nitrate transport protein nrtB 

Query Match 70.7%; Score 29; DB 2; Length 244; 

Best Local Similarity 62.5%; Pred. No. 1.4e+02; 

Matches 5; Conservative 2; Mismatches 1; Indels 0; Gaps 0; 



Qy 1 LVFFAEDF 8 

: : I I I I I 

Db 186 VLFFAESF 193 



RESULT 63 
B86301 

hypothetical protein F19K19.10 - Arabidopsis thaliana 
C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 02-Mar-2001 #sequence_revision 02-Mar-2001 #text_change 31-Dec-2001 
C;Accession: B86301 

R;Theologis, A.; Ecker, J.R.; Palm, C.J.; Federspiel, N.A.; Kaul, S.; White, O.; 
Alonso, J.; Altaf, H.; Araujo, R. ; Bowman, C.L.; Brooks, S.Y.; Buehler, E. ; 
Chan, A.; Chao, Q. ; Chen, H.; Cheuk, R.F.; Chin, C.W. ; Chung, M.K.; Conn, L.; 
Conway, A.B.; Conway, A.R.; Creasy, T.H.; Dewar, K. ; Dunn, P.; Etgu, P.; 
Feldblyum, T.V.; Feng, J.; Fong, B . ; Fujii, C.Y.; Gill, J.E.; Goldsmith, A.D.; 
Haas, B.; Hansen, N.F.; Hughes, B. ; Huizar, L. 
Nature 408, 816-820, 2000 

A;Authors: Hunter, J.L.; Jenkins, J.; Johnson-Hopson, C^TChan, S.; Khaykin, E . ; 
Kim, C.J.; Koo, H.L.; Kremenetskaia, I.; Kurtz, D.B.;^fwan, A.; Lam, B.; Langin- 
Hooper, S.; Lee, A.; Lee, J.M.; Lenz, C.A.; Li, J.H.;~Li # Y . ; Lin, X.; Liu, 



S.X.; Liu, Z.A.; Luros, J.S.; Maiti, R. ; Marziali, A.; Militscher, J.; Miranda, 
M. ; Nguyen, M. ; Nierman, W.C.; Osborne, B.I.; Pai, G. ; Peterson, J.; Pham, P.K.; 
Rizzo, M. ; Rooney, T.; Rowley, D.; Sakano, H. 

A;Authors: Salzberg, S.L.; Schwartz, J.R.; Shinn, P.; Southwick, A.M.; Sun, H.; 
Tallon, L.J.; Tambunga, G. ; Toriumi, M.J.; Town, CD.; Utterback, T.; van Aken, 
S.; Vaysberg, M. ; Vysotskaia, V.S.; Walker, M. ; Wu, D.; Yu, G. ; Fraser, CM.; 
Venter, J.C.; Davis, R.W. 

A; Title: Sequence and analysis of chromosome 1 of the plant Arabidopsis. 

A; Reference number: A86141; MUID:21016719; PMID: 11130712 

A;Accession: B86301 

A; Status : preliminary 

A; Molecule type: DNA 

A;Residues: 1-247 <STO> 

A;Cross-references: GB:AE005172; NID: g9989059; PIDN : AAG10822 . 1 ; GSPDB : GN00141 
C; Genetics : 
A;Map position: 1 

Query Match 70.7%; Score 29; -DB 2; Length 247; 

Best Local Similarity 71.4%; Pred. No. 1.4e+02; 

Matches 5; Conservative 2; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 LVFFAED 7 

: I I I : I I 

Db 169 WFFSED 175 



RESULT 64 
C75200 

hypothetical protein PAB2263 - Pyrococcus abyssi (strain Orsay) 
C; Species: Pyrococcus abyssi 

C;Date: 20-Aug-1999 #sequence_revision 20-Aug-1999 #text_change 20-Jun-2000 

C; Access ion: C75200 

R; anonymous, Genoscope 

submitted to the EMBL Data Library, July 1999 

A; Description: Pyrococcus abyssi genome sequence: insights into archaeal 

chromosome structure and evolution. 

A; Reference number: A75001 

A;Accession: C75200 

A; Status : preliminary 

A;Molecule type: DNA 

A; Residues: 1-251 <KAW> 

A; Cross-references: GB:AJ248283; GB:AL096836; NID : g5457433 ; PIDN : CAB49050 . 1 ; 
PID:g5457559 

A; Experimental source: strain Orsay 

C; Genetics : 

A; Gene: PAB2263 

C; Super family : Methanococcus jannaschii conserved hypothetical protein MJ0225 

Query Match 70.7%; Score 29; DB 2; Length 251; 

Best Local Similarity 62.5%; Pred. No. 1.4e+02; 

Matches 5; Conservative 2; Mismatches 1; Indels 0; Gaps 

Qy 1 LVFFAED F 8 

: I I III: 

Db 111 WFLAEDY 118 



RESULT 65 
T06661 

hypothetical protein T6G15.140 - Arabidopsis thaliana 
C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 23-Apr-1999 #sequence_revision 23-Apr-1999 #text_change 22-Oct-1999 
C;Accession: T06661 

R;Bevan, M. ; Murphy, G. ; Ridley, P.; Hudson, S.; Bancroft, I.; Mewes, H.W.; 
Mayer, K.F.X.; Schueller, C. 

submitted to the Protein Sequence Database, April 1999 
A; Reference number: Z 157 91 
A;Accession: T06661 
A;Molecule type: DNA 
A; Residues: 1-27 3 <BEV> 

A; Cross-references: EMBL : AL049656; GSPDB : GN00062 ; ATSP : T6G15 . 140 
A; Experimental source: cultivar Columbia; BAC clone T6G15 
C; Genetics : 

A;Gene: ATSP:T6G15 . 140 
A; Map position: 4 

A;Introns: 48/1; 95/3; 111/3; 144/1; 220/3; 237/3 

Query Match 70.7%; Score 29; DB 2; Length 273; 

Best Local Similarity 100.0%; Pred. No. 1.5e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 LVFFAE 6 

I I I I I I 

Db 216 LVFFAE 221 



RESULT 66 
F86805 

cation transporter yogJ [imported] - Lactococcus lactis subsp. lactis (strain 
IL1403) 

C; Species: Lactococcus lactis subsp. lactis 

C;Date: 23-Mar-2001 #sequence_revision 23-Mar-2001 #text_change 03-Aug-2001 
C; Accession: F86805 

R;Bolotin, A.; Wincker, P.; Mauger, S.; Jaillon, O.; Malarme, K.; Weissenbach, 
J.; Ehrlich, S.D.; Sorokin, A. 
Genome Res. 11, 731-753, 2001 

A; Title: The complete genome sequence of the lactic acid bacterium Lactococcus 
lactis ssp. lactis IL1403. 

A; Reference number: A86625; MUID: 21235186; PMID : 1133747 1 
A;Accession: F86805 
A; Status: preliminary 
A;Molecule type: DNA 
A; Residues: 1-314 <STO> 

A;Cross-references: GB:AE005176; PID : gl2724436; PIDN : AAK05544 . 1; GSPDB: GN00146 
A; Experimental source: strain IL1403 
C; Genetics : 
A; Gene : yog J 

Query Match 70.7%; Score 29; DB 2; Length 314; 

Best Local Similarity 85.7%; Pred. No. 1.8e+02; 

Matches 6; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 



1 LVFFAE D 7 
II I I I I 



Db 



163 LVIFAED 169 



RESULT 67 
H71729 

hypothetical protein RP189 - Rickettsia prowazekii 
C; Species: Rickettsia prowazekii 

C;Date: 21-Nov-1998 #sequence_revision 21-Nov-1998 #text_change 03-Nov-2000 
C;Accession: H71729 

R;Andersson, S.G.E.; Zomorodipour, A.; Andersson, J.O.; Sicheritz-Ponten, T-; 
Alsmark, U.C.M.; Podowski, R.M. ; Naeslund, A.K.; Eriksson, A.S.; Winkler, H.H.; 
Kurland, C.G. 
Nature 396, 133-140, 1998 

A;Title: The genome sequence of Rickettsia prowazekii and the origin of 
mitochondria . 

A; Reference number: A71630; MUID: 99039499; PMID: 9823893 
A;Accession: H71729 

A; Status: preliminary; nucleic acid sequence not shown; translation not shown 
A; Molecule type: DNA 
A; Residues: 1-321 <AND> 

A;Cross-references: GB:AJ235270; GB:AJ235269; NID: g3860572; PIDN: CAAl 4655 . 1 ; 
PID:el342498; PID : g3860754 ; GSPDB: GN00081 
A; Experimental source: strain Madrid E 
C; Genetics : 
A; Gene: RP18 9 

C; Superfamily : Rickettsia prowazekii hypothetical protein RP189 

Query Match 70.7%; Score 29; DB 2; Length 321; 

Best Local Similarity 71.4%; Pred. No. 1.8e+02; 

Matches 5; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LVFFAED 7 

I : I I I I 

Db 179 LIFFAHD 185 



RESULT 68 
T25676 

hypothetical protein F08D12.8 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C;Accession: T25676 

R;Le, T. ; Waterston, R. <. 

submitted to the EMBL Data Library, December 1996 

A; Description : The sequence of C. elegans cosmid F08D12. 

A;Reference number: Z20068 

A;Accession: T25676 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A; Residues: 1-332 <LET> 

A;Cross-references: EMBL:U80840; PIDN : AAB37932 . 1 ; GSPDB : GN00020 ; CESP : F08D12 . 8 

A; Experimental source: strain Bristol N2 ; clone F08D12 

C; Genetics : 

A; Gene: CESP:F08D12. 8 

A;Map position: 2 

A;Introns: 27/2 



Query Match 70.7%; Score 29; DB 2; Length 332; 

Best Local Similarity 62.5%; Pred. No. 1.9e+02; 

Matches 5; Conservative 2; Mismatches 1; Indels 0; Gaps 



0; 



Qy 1 LVFFAEDF 8 

I I I I : I : 

Db 73 LVFFKKDY 8 0 



RESULT 69 
S35229 

hypD 1 protein - Bradyrhizobium japonicum (fragment) 
C; Species: Bradyrhizobium japonicum 

C;Date: 31-Dec-1993 #sequence__revision 31-Dec-1993 #text_change 14-Apr-2003 
C;Accession: S35229; S28641 

R;van Soom, C; Verreth, C. ; Sampaio, M.J.; Vanderleyden, J. 
Mol. Gen. Genet. 239, 235-240, 1993 

A; Title: Identification of a potential transcriptional regulator of hydrogenase 

activity in free-living Bradyrhizobium japonicum strains. 

A; Reference number: S35229; MUID: 93287991; PMID: 8510650 

A;Accession: S35229 

A; Molecule type : DNA 

A; Residues: 1-347 <VAN> 

A; Cross-references: EMBL:Z17373; NID:g39534; PIDN : CAA78988 . 1; PID:g39535 
C; Genetics : 
A; Gene: hypD 1 

C; Super family : hydrogenase maturation factor, HypD type 

Query Match 7 0.7%; Score 29; DB 2; Length 347; 

Best Local Similarity 83.3%; Pred. No. 2e+02; 

Matches 5; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 3 FFAEDF 8 

I I I I : I 

Db 182 FFAEEF 187 



RESULT 7 0 
S59678 

HST2 protein - yeast (Saccharomyces cerevisiae) 
N;Alternate names: protein LPA2c; protein YPL015c 
C; Species: Saccharomyces cerevisiae 

C;Date: 13-Jan-1996 #sequence_revision Ol-Mar-1996 #text_change 29-Oct-1999 
C;Accession: S59678 

R;Hall, J.; Ahmed, A.; Bussey, H . ; Fortin, N.; Friesen, J.D.; Storms, R.K.; Vo, 
D.H.; Wang, Y. ; Winnett, E. 

submitted to the EMBL Data Library, August 1995 

A; Description: The sequence of Saccharomyces cerevisiae chromosome XVI left arm. 
A;Reference number: S59677 
A;Accession: S59678 
A; Molecule type: DNA 
A; Residues: 1-357 <HAL> 

A; Cross-references: EMBL:U33335; NID:g965076; PIDN : AAB68090 . 1 ; PID:g965078; 

MIPS: YPL015c 

C; Genetics : 

A; Gene: SGD:HST2 

A; Cross-references: SGD : S0005936; MIPS:YPL015c 



A; Map position: 16L 

C; Keywords: transmembrane protein 

F; 218-234/Domain: transmembrane #status predicted <TMM> 



Query Match 70.7%; Score 29; DB 2; Length 357; 

Best Local Similarity 71.4%; Pred. No. 2e+02; 

Matches 5; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LVFFAED 7 

: I I I II 

Db 181 IVFFGED 187 



RESULT 71 
S32877 

hypD protein - Rhizobium leguminosarum bv. viciae 
C; Species: Rhizobium leguminosarum bv. viciae 

C;Date: 31-Dec-1993 ffsequence_revision 31-Dec-1993 #text_change 14-Apr-2003 
C;Accession: S32877 

R;Rey, L.; Murillo, J.; Hernando, Y. ; Hidalgo, E. ; Cabrera, E.; Imperial, J.; 
Ruiz-Argueeso, T. 

Mol. Microbiol. 8, 471-481, 1993 

A; Title: Molecular analysis of a microaerobically induced operon required for 
hydrogenase synthesis in Rhizobium leguminosarum biovar viciae. 
A;Reference number: S32873; MUID : 93316844 ; PMID : 8326860 
A;Accession: S32877 

A; Status: nucleic acid sequence not shown 
A; Molecule type: DNA 
A; Residues: 1-385 <REY> 

A;Cross-references: EMBL:X52974; NID : gll67855; PIDN: CAA37 163 . 1 ; PID:g48736 
C; Genetics : 
A; Gene : hypD 

C; Superf amily : hydrogenase maturation factor, HypD type 

Query Match 70.7%; Score 29; DB 2; Length 385; 

Best Local Similarity 83.3%; Pred. No. 2.2e+02; 

Matches 5; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 3 FFAEDF 8 

I I I I : I 

Db 215 FFAEEF 220 



RESULT 72 
T25049 

hypothetical protein T21B4.1 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct~1999 
C;Accession: T25049 
R;Smye, R. 

submitted to the EMBL Data Library, October 1996 
A; Reference number: Z19974 
A; Accession: T25049 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A; Residues: 1-394 <WIL> 

A; Cross-references : EMBL:Z81124; PIDN : CAB03370 . 1 ; GSPDB : GN00020 ; CESP:T21B4.1 



A; Experimental source: clone T21B4 

C; Genetics : 

A; Gene: CESP:T21B4.1 

A; Map position: 2 

A;Introns: 33/1; 57/1; 105/1; 183/3; 312/3 



Query Match 70.7%; Score 29; DB 2; Length 394; 

Best Local Similarity 62.5%; Pred. No. 2.3e+02; 

Matches 5; Conservative 1; Mismatches 2; Indels 0; Gaps 0; 



Qy 1 LVFFAEDF 8 

I : I I II 
Db 226 LIFFLPDF 233 



RESULT 7 3 
E90553 

hypothetical protein MYPU_3330 [imported] - Mycoplasma pulmonis (strain UAB 
CTIP) 

C; Species: Mycoplasma pulmonis 

C;Date: 24-May-2001 (fsequence_revision 24-May-2001 #text__change 03-Aug-2001 
C;Accession: E90553 

R;Chambaud, I.; Heilig, R. ; Ferris, S.; Barbe, V.; Samson, D.; Galisson, F. ; 
Moszer, I.; Dybvig, K. ; Wroblewski, H. ; Viari, A.; Rocha, E.P.C.; Blanchard, A. 
Nucleic Acids Res. 29, 2145-2153, 2001 

A; Title: The complete genome sequence of the murine respiratory pathogen 
Mycoplasma pulmonis. 

A;Reference number: A99512; MUID: 21267165; PMID : 11353084 
A;Accession: E90553 
A; Status: preliminary 
A; Molecule type: DNA 
A; Residues: 1-42 0 <KUR> 

A;Cross-references: GB:AL445566; PID ; gl4089747 ; PIDN: CAC13506 . 1 ; GSPDB : GN00153 

A; Experimental source: strain UAB CTIP 

C; Genetics : 

A; Gene: MYPU_3330 

A; Genetic code: SGC3 

C; Superf amily : histidine-tRNA ligase; his tidine-tRNA ligase homology 

Query Match 70.7%; Score 29; DB 2; Length 420; 

Best Local Similarity 50.0%; Pred. No. 2.4e+02; 

Matches 4; Conservative 4; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 LVFFAEDF 8 

: : I I I : : I 

Db 141 IIFFAKNF 148 



RESULT 74 
E71082 

probable phospho-sugar mutase 2 - Pyrococcus horikoshii 
C; Species: Pyrococcus horikoshii 

C;Date: 14-Aug-1998 #sequence_revision 14-Aug-1998 #text_change 20-Jun-2000 
C;Accession: E71082 

R;Kawarabayasi, Y.; Sawada, M. ; Horikawa, H.; Haikawa, Y.; Hino, Y. ; Yamamoto, 
S.; Sekine, M. ; Baba, S.; Kosugi, H. ; Hosoyama, A.; Nagai, Y. ; Sakai, M. ; Ogura, 
K.; Otsuka, R. ; Nakazawa, H.; Takamiya, M. ; Ohfuku, Y. ; Funahashi, T.; Tanaka, 



T . ; Kudoh, Y. ; Yamazaki, J.; Kushida, N.; Oguchi, A.; Aoki, K. ; Yoshizawa, T.; 
Nakamura, Y. ; Robb, F.T.; Horikoshi, K. ; Masuchi, Y. ; Shizuya, H.; Kikuchi, H. 
DNA Res. 5, 55-76, 1998 

A; Title: Complete sequence and gene organization of the genome of a hyper- 
t hemophilic archaebacterium, Pyrococcus horikoshii 0T3 . 
A/Reference number: A71000; MUID: 98344137; PMID:9679194 
A;Accession: E71082 

A; Status: preliminary; nucleic acid sequence not shown; translation not shown 
A; Molecule type: DNA 
A; Residues: 1-455 <KAW> 

A;Cross-references: GB:AP000004; NID: g3236131; PIDN : BAA30019 . 1 ; PID:g3257336 
A; Experimental source: strain 0T3 

A; Note: this accession replaces an interim accession for a sequence replaced by 

GenBank 

C; Genetics : 

A; Gene: PH0923 

C; Super f amily : phosphomannomutase 

Query Match 70.7%; Score 29; DB 2; Length 455; 

Best Local Similarity 71.4%; Pred. No. 2.6e+02; 

Matches 5 ; Conservative 1 ; Mismatches 1 ; Indels 0 ; Gaps 0 ; 

Qy 2 VFFAEDF 8 

: I I III 
Db 131 LFFKEDF 137 



RESULT 75 
B84230 

hypothetical protein Vng0727c [imported] - Halobacterium sp. NRC-1 
C; Species: Halobacterium sp. NRC-1 

C;Date: 02-Feb-2001 #sequence_revision 02-Feb-2001 #text_change 02-Feb-2001 
C;Accession: B84230 

R;Ng, W.V. ; Kennedy, S.P.; Mahairas, G.G.; Berquist, B.; Pan, M. ; Shukla, H.D.; 
Lasky, S.R.; Baliga, N.; Thorsson, V.; Sbrogna, J.; Swartzell, S.; Weir, D.; 
Hall, J.; Dahl, T.A. ; Welti, R. ; Goo, Y.A. ; Leithauser, B.; Keller, K. ; Cruz, 
R.; Danson, M.J.; Hough, D.W.; Maddocks, D.G.; Jablonski, P.E.; Krebs, M.P.; 
Angevine, CM.; Dale, H. ; Isenbarger, T.A.; Peck, R.F.; Pohlschrod, M. ; Spudich, 
J.L.; Jung, K.H.; Alam, M. ; Freitas, T. 
Proc. Natl. Acad. Sci. U.S.A. 97, 12176-12181, 2000 

A;Authors: Hou, S.; Daniels, C.J.; Dennis, P.P.; Omer, A.D.; Ebhardt, H.; Lowe, 

T.M.; Liang, P.; Riley, M. ; Hood, L.; DasSarma, S. 

A;Title: Genome sequence of Halobacterium species NRC-1. 

A;Reference number: A84160; MUID : 20504483 ; PMID: 11016950 

A; Accession: B8 4230 

A; Status: preliminary 

A;Molecule type: DNA 

A; Residues: 1-494 <STO> 

A;Cross-references: GB:AE004437; NID: gl0580306; PIDN : AAG19206 . 1 ; GSPDB : GN00138 

C; Genetics: 

A; Gene: VNG0727C 

Query Match 70.7%; Score 29; DB 2; Length 494; 

Best Local Similarity 83.3%; Pred. No. 2.9e+02; 

Matches 5; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 



3 FFAEDF 8 



II III 

Db 125 FFVEDF 130 



RESULT 76 
T36945 

hypothetical protein SCJ1 . 12 - Streptomyces coelicolor 
C; Species: Streptomyces coelicolor 

C;Date: 03-Dec-1999 #sequence_revision 03-Dec-1999 #text_change 03-Dec-1999 
C; Accession: T36945 

R;Seeger, K.J.; Harris, D-; Thomson, N.R.; Parkhill, J.; Barrell, B.G-; 
Raj andream, M. A. 

submitted to the EMBL Data Library, September 1999 
A; Reference number: Z21607 
A;Accession: T36945 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A; Residues: 1-508 <SEE> 

A; Cross-references: EMBL : AL109962 ; PIDN : CAB53130 . 1 ; GSPDB: GN00070 ; 
SCOEDB: SCJ1. 12 

A; Experimental source: strain A3 (2) 
C; Genetics : 

A; Gene : SCOEDB : SC Jl . 12 

Query Match 70.7%; Score 29; DB 2; Length 508; 

Best Local Similarity 71.4%; Pred. No. 2.9e+02; 

Matches 5; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 



Qy 1 LVFFAED 7 

: I II II 

Db 25 MVFFVED 31 



RESULT 77 
T11391 

cytochrome-c oxidase (EC 1.9.3.1) chain I - fat dormouse mitochondrion 
C; Species: mitochondrion Myoxus glis (fat dormouse) 

C;Date: 16-Jul-1999 #sequence_revision 16-Jul-1999 #text_change 24-Oct-2000 
C;Accession: T11391 

R;Reyes, A.; Pesole, G. ; Saccone, C. 
Mol. Biol. Evol. 15, 499-505, 1998 

A; Title: Complete mitochondrial DNA sequence of the fat dormouse, Glis glis: 
further evidence of rodent paraphyly. 

A;Reference number: Z17267; MUID: 98242079; PMID:9580978 
A;Accession: T11391 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A; Residues: 1-513 <REY> 

A; Cross-references: EMBL : AJ001562 ; NID: g3127895; PIDN : CAA04832 . 1; PID : g3127898 ; 
GSPDB: GN00132 
C; Genetics : 
A; Gene : COI 

A; Genome: mitochondrion 
A; Genetic code: SGC1 

C; Super family : cytochrome-c' oxidase chain I; cytochrome-c oxidase chain I 
homology 



C; Keywords: copper; electron transfer; magnesium; membrane-associated complex; 
metalloprotein; mitochondrion; oxidoreductase; respiratory chain 
F; 11-457/Domain: cytochrome-c oxidase chain I homology <COl> 
F;240, 290,291/Binding site: copper (His) #status predicted 
F;240-244/Cross-link: 1 1 -histidyl-3 1 -tyrosine (His-Tyr) ftstatus predicted 
F;244/Binding site: oxygen (Tyr) ((status predicted 

F; 368/Binding site: magnesium (His) (shared with chain II) #status predicted 

Query Match 70.7%; Score 29; DB 2; Length 513; 

Best Local Similarity 62.5%; Pred. No. 3e+02; 

Matches 5; Conservative 1; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

I II : I I 

Db 423 LTFFPQDF 430 



RESULT 7 8 
G86587 

heat shock protein-60 [imported] - Chlamydophila pneumoniae (strain J138) 
C; Species: Chlamydophila pneumoniae, Chlamydia pneumoniae 

C;Date: 02-Mar-2001 #sequence_revision 02-Mar-2001 #text_change 23-Mar-2001 
C;Accession: G86587 

R;Shirai, M. ; Hirakawa, H.; Kimoto, M. ; Tabuchi, M. ; Kishi, F. ; Ouchi, K. ; 
Shiba, T.; Ishii, K. ; Hattori, M. ; Kuhara, S.; Nakazawa, T. 
Nucleic Acids Res. 28, 2311-2314, 2000 

A; Title: Comparison of whole genome sequences of chlamydia pneumoniae J138. 

A; Reference number: A86491; MUID: 20330349; PMID: 10871362 

A;Accession: G86587 

A; Status: preliminary 

A;Molecule type: DNA 

A; Residues: 1-526 <STO> 

A; Cross-references: GB:BA000008; NID: g8979150; PIDN : BAA98985 . 1 ; GSPDB : GN00142 

A; Experimental source: strain J138 

C; Genetics : 

A; Gene: groEL_2 

C; Super family : chaperonin groEL 

Query Match 70.7%; Score 29; DB 2; Length 526; 

Best Local Similarity 75.0%; Pred. No. 3e+02; 

Matches 6; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

II I I I I 
Db 245 LVIIAEDF 252 



RESULT 7 9 
D72036 

heat shock protein-60 - Chlamydophila pneumoniae (strain CWL029) 
C; Species: Chlamydophila pneumoniae, Chlamydia pneumoniae 

C;Date: 23-Apr-1999 #sequence_revision 23-Apr-1999 #text_change 05-May-2000 
C;Accession: D72036 

R;Kalman, S.; Mitchell, W. ; Marathe, R. ; Lammel, C; Fan, J.; Olinger, L.; 
Grimwood, J.; Davis, R.W. ; Stephens, R.S. 
Nature Genet. 21, 385-389, 1999 

A;Title: Comparative genomes of Clamydia pneumoniae and C. trachomatis. 



A; Reference number: A72000; MUID: 99206606; PMID : 10192388 
A; Accession: D72 036 
A; Status: preliminary 
A; Molecule type: DNA 
A; Residues: 1-526 <ARN> 

A; Cross-references: GB:AE001659; GB:AE001363; NID : g4377081 ; PIDN: AAD18915 . 1; 
PID:g4377086 

A; Experimental source: strain CWL029 

C; Genetics : 

A; Gene: groEL_2 

C; Superf amily : chaperonin groEL 

Query Match 70.7%; Score 29; DB 2; Length 526; 

Best Local Similarity ^75.0%; Pred. No. 3e+02; 

Matches 6; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

II I I I I 

Db 245 LVIIAEDF 252 



RESULT 8 0 
F81504 

60 kDa chaperonin CP1095 [imported] - Chlamydophila pneumoniae (strain AR39) 
C; Species: Chlamydophila pneumoniae , Chlamydia pneumoniae 

C;Date: 31-Mar-2000 #sequence_revision 31-Mar-2000 #text_change 19-May-2000 
C; Accession: F81504 

R;Read, T.D.; Brunham, R.C.; Shen, C; Gill, S.R.; Heidelberg, J.F.; White, 0.; 
Hickey, E.K.; Peterson, J.; Utterback, T . ; Berry, K. ; Bass, S.; Linher, K. ; 
Weidman, J.; Khouri, H.; Craven, B. ; Bowman, C. ; Dodson, R.; Gwinn, M. ; Nelson, 
W. ; DeBoy, R. ; Kolonay, J.; McClarty, G. ; Salzberg, S.L.; Eisen, J.; Fraser, 
CM. 

Nucleic Acids Res. 28, 1397-1406, 2000 

A; Title: Genome sequences of Chlamydia trachomatis MoPn and Chlamydia pneumoniae 
AR39. 

A; Reference number: A81500; MUID: 20150255 ; PMID : 10684935 
A; Access ion: F81504 
A; Status : preliminary 
A; Molecule type: DNA 
A; Residues: 1-526 <REA> 

A; Cross-references: GB:AE002265; GB:AE002161; NID: g7189999; PIDN: AAF38865 . 1; 

PID:g7190008; GSPDB: GN00122 ; TIGR:CP1095 

A; Experimental source: strain AR39, HL cells 

C; Genetics : 

A; Gene: CP1095 

C; Superf amily : chaperonin groEL 

Query Match 70.7%; Score 29; DB 2; Length 526; 

Best Local Similarity 75.0%; Pred. No. 3e+02; 

Matches 6; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

II I I I I 
Db 245 LVIIAEDF 252 



RESULT 81 



E82269 

conserved hypothetical protein VC0880 [imported] - Vibrio cholerae (strain 

N16961 serogroup 01) 

C; Species: Vibrio cholerae 

C;Date: 18-Aug-2000 #sequence_revision 20-Aug-2000 #text_change 02-Feb-2001 
C;Accession: E82269 

R;Heidelberg, J.F.; Eisen, J. A. ; Nelson, W.C.; Clayton, R.A. ; Gwinn, M.L.; 
Dodson, R.J.; Haft, D.H. ; Hickey, E.K.; Peterson, J.D.; Umayam, L.A. ; Gill, 
S.R.; Nelson, K.E.; Read, T.D.; Tettelin, H. ; Richardson, D.; Ermolaeva, M.D.; 
Vamathevan, J.; Bass, S.; Qin, H.; Dragoi, I.; Sellers, P.; McDonald, L. ; 
Utterback, T.; Fleishmann, R.D.; Nierman, W.C.; White, 0. ; Salzberg, S.L.; 
Smith, H.O.; Colwell, R.R.; Mekalanos, J. J.; Venter, J.C.; Fraser, CM. 
Nature 406, 477-483, 2000 

A; Title: DNA Sequence of both chromosomes of the cholera pathogen Vibrio 
cholerae . 

A; Reference number: A82035; MUID : 2 04 06833 ; PMID : 1 0952301 
A; Accession: E82269 
A; Status: preliminary 
A;Molecule type: DNA 
A; Residues: 1-534 <HEI> 

A; Cross-references: GB:AE004172; GB:AE003852; NID : g9655323; PIDN: AAF94042 . 1; 
GSPDB:GN00126; TIGR:VC0880 

A; Experimental source: serogroup 01; strain N16961; biotype El Tor 
C; Genetics: 
A;Gene: VC0880 
A;Map position: 1 

Query Match 70.7%; Score 29; DB 2; Length 534; 

Best Local Similarity 75.0%; Pred. No. 3.1e+02; 

Matches 6; Conservative 0; Mismatches 2; Indels 0; Gaps 0 

Qy 1 LVFFAEDF 8 

I I I I I I 

Db 373 LVFFAAQF 380 



RESULT 82 
T32523 

hypothetical protein C49A9.2 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 29-Oct-1999 #sequence_revision 29-Oct-1999 #text_change 29-Oct-1999 

C;Accession: T32523 

R; Fulton, B.; Wohldmann, P. 

submitted to the EMBL Data Library, December 1997 



A 
A 
A 
A 
A 
A 
A 
A 
C 
A 
A 
A 



Description: The sequence of C. elegans cosmid C49A9. 
Reference number: Z21184 
Accession: T32523 

Status: preliminary; translated from GB/EMBL/DDBJ 
Molecule type: DNA 
Residues: 1-572 <FUL> 

Cross-references: EMBL : AF0 36693; PIDN : AAB88335 . 1; GSPDB : GN00022 ; CESP:C49A9. 
Experimental source: strain Bristol N2; clone C49A9 
Genetics : 

Gene: CESP:C49A9.2 
Map position: 4 

Introns: 28/2; 83/3; 127/2; 186/3; 231/3; 281/3; 321/3; 363/2; 449/2; 530/3 



Query Match 70.7%; Score 29; DB 2; Length 572; 

Best Local Similarity 75.0%; Pred. No, 3.3e+02; 

Matches 6; Conservative 0; Mismatches 2; Indels 0; Gaps 



0; 



Qy 1 LVFFAEDF 8 

Db 99 LVFSTEDF 106 



RESULT 83 
T24172 

hypothetical protein R11A8 . 4 - Caenorhabditis elegans 
C; Species : Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C; Accession: T24172 
R; Cummings , P . 

submitted to the EMBL Data Library, March 1996 
A; Reference number: Z19849 
A; Accession: T24172 

A; Status : preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A; Residues: 1-607 <WIL> 

A; Cross-references : EMBL:Z70310; PIDN : CAA94364 . 1; GSPDB: GN00022 ; CESP:RllA8.4 

A; Experimental source: clone R11A8 

C; Genetics : 

A; Gene: CESP:R11A8.4 

A;Map position: 4 

A;Introns: 18/3; 68/3; 111/3; 206/3; 500/3; 565/3 

Query Match 70.7%; Score 29; DB 2; Length 607; 

Best Local Similarity 71.4%; Pred. No. 3.5e+02; 

Matches 5; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LVFFAED 7 

: I I I II 

Db 298 IVFFGED 304 



RESULT 84 
S56284 

hypothetical protein YFR029w - yeast ( Saccharomyces cerevisiae) 
N; Alternate names : hypothetical protein F001 
C; Species : Saccharomyces cerevisiae 

C;Date: 02-Sep-1995 #sequence_revision 19-Oct-1995 #text__change 20~Jun-2000 
C;Accession: S56284; S62240; S63834 

R;Murakami, Y.; Naitou, M. ; Hagiwara, H. ; Shibata, T.; Ozawa, M. ; Sasanuma, 
S.I.; Sasanuma, M. ; Tsuchiya, Y. ; Soeda, E.; Yokoyama, K. ; Yamazaki, M. ; 
Tashiro, H . ; Eki, T. 

submitted to the EMBL Data Library, May 1995 

A; Description : Analysis of the nucleotide sequence of chromosome VI from 

Saccaromyces cerevisiae . 

A; Reference number : S5618 6 

A; Access ion: S56284 

A; Molecule type : DNA 

A; Residues: 1-678 <MUR> 

A;Cross-references : EMBL:D50617; NID:g836685; PIDN : BAA09268 . 1 ; PID:g836784; 
MIPS: YFR029w 



R; Murakami, Y. 

submitted to the EMBL Data Library, December 1994 
A;Reference number: S62230 
A;Accession: S62240 
A;Molecule type: DNA 
A; Residues: 1-678 <MUW> 

A;Cross-references: EMBL:D44602; NID:g893419; PIDN : BAA08040 . 1 ; PID:g893424 
R;Eki, T.; Naitou, M. ; Hagiwara, H.; Abe, M. ; Ozawa, M. ; Sasanuma, S.I.; 
Sasanuma, M. ; Tsuchiya, Y. ; Shibata, T . ; Watanabe, K. ; Ono, A.; Yamazaki, M.A. ; 
Tashiro, H. ; Hanaoka, F. ; Murakami, Y. 
Yeast 12, 177-190, 1996 

A; Title: Fifteen open reading frames in a 30.8 kb region of the right arm of 
chromosome VI from Saccharomyces cerevisiae. 
A;Reference number: S63830; MUID: 96287654 ; PMID:8686381 
A; Accession : S 638 34 

A; Status : nucleic acid sequence not shown; translation not shown 
A;Molecule type : DNA 
A; Residues: 1-678 <EKI> 

A;Cross-references : EMBL:D44602; NID:g893419; PIDN: BAA08040 . 1 ; PID:g893424 
A;Note: the nucleotide sequence was submitted to the EMBL Data Library, July 
1995 

C; Genetics : 

A; Gene: SGD: PTR3 

A;Cross-references: SGD : S0001925 ; MIPS:YFR029w 
A; Map position : 6R 

C; Superf amily: Saccharomyces cerevisiae hypothetical protein YFR029w 

Query Match 70.7%; Score 29; DB 2; Length 678; 

Best Local Similarity 83.3%; Pred. No. 4e+02; 

Matches 5; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 3 FFAEDF 8 

II III 

Db 654 FFVEDF 659 



RESULT 85 
AF2275 

cellulose synthase catalytic chain [imported] - Nostoc sp. (strain PCC 7120) 
C; Species: Nostoc sp. PCC 7120 

A;Note: Nostoc sp. strain PCC 7120 is a synonym of Anabaena sp. strain PCC 7120 
C;Date: 14-Dec-2001 #sequence_revision 14-Dec-2001 #text_change 09-Dec-2002 
C; Accession: AF2275 

R;Kaneko, T.; Nakamura, Y.; Wolk, CP.; Kuritz, T.; Sasamoto, S.; Watanabe, A. ; 
Iriguchi, M. ; Ishikawa, A.; Kawashima, K. ; Kimura, T.; Kishida, Y.; Kohara, M. ; 
Matsumoto, M. ; Matsuno, A.; Muraki, A.; Nakazaki, N . ; Shimpo, S.; Sugimoto, M. ; 
Takazawa, M. ; Yamada, M. ; Yasuda, M. ; Tabata, S. 
DNA Res. 8, 205-213, 2001 

A; Title: Complete Genomic Sequence of the Filamentous Nitrogen-fixing 

Cyanobacterium Anabaena sp. strain PCC 7120. 

A; Reference number: AB1807; MUID: 21595285; PMID: 11759840 

A; Accession : AF2275 

A; Status : preliminary 

A; Molecule type: DNA 

A; Residues: 1-693 <KUR> 

A;Cross-references: GB:BA000019; PIDN : BAB75456 . 1 ; PID: gl7 132891 ; GSPDB : GN00179 
A; Experimental source: strain PCC 712 0 



C; Genetics : 
A;Gene: alr3757 



Query Match 70.7%; Score 29; DB 2; Length 693; 

Best Local Similarity 62.5%; Pred. No. 4.1e+02; 

Matches 5; Conservative 2; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

I : I I : I I 

Db 42 0 LQYFADDF 427 



RESULT 86 
JH0595 

potassium channel protein cdrK - rat 

C; Species: Rattus norvegicus (Norway rat) 

C;Date: 17-Jul-1992 #sequence_revision 17-Jul-1992 #text_change 20-Aug-1999 
C;Accession: JH0595 

R;Hwang, P.M.; Glatt, C.E.; Bredt, D.S.; Yellen, G.; Snyder, S.H. 
Neuron 8, 473-481, 1992 

A; Title: A novel K+ channel with unique localizations in mammalian brain: 
molecular cloning and characterization. 

A; Reference number: JH0595; MUID: 92198655; PMID: 1550672 
A; Accession: JH0595 
A; Molecule type: mRNA 
A; Residues: 1-802 <HWA> 

A; Cross-references: GB:M77482; NID:g203395; PIDN : AAA4 0905 . 1 ; PID:g203396 
A; Experimental source: circumvallate papillae 
C; Superf amily : potassium channel protein drkl 

C; Keywords: glycoprotein; phosphoprotein; transmembrane protein 

F; 191-2 12 /Domain : transmembrane ftstatus predicted <TM1> 

F; 233-254/Domain: transmembrane ffstatus predicted <TM2> 

F; 2 65-28 6/ Domain : transmembrane ((status predicted <TM3> 

F;299-320/Domain: transmembrane ftstatus predicted <TM4> 

F; 335-356/Domain : transmembrane ftstatus predicted <TM5> 

F; 3 97-4 18 /Domain : transmembrane #status predicted <TM6> 

F; 187,287, 419, 446/Binding site: carbohydrate (Asn) (covalent) #status predicted 
F; 448, 500/Binding site: phosphate (Ser) (covalent) #status predicted 

Query Match 70.7%; Score 29; DB 2; Length 802; 

Best Local Similarity 100.0%; Pred. No. 4.7e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 LVFFAE 6 

I I I I I I 

Db 352 LVFFAE 357 



RESULT 87 
F75294 

DNA modification methyltransf erase-related protein - Deinococcus radiodurans 
(strain Rl) 

C; Species: Deinococcus radiodurans 

C;Date: 03-Dec-1999 #sequence_revision 03-Dec-1999 #text_change 17-Mar-2000 
C; Accession: F75294 

R;White, O. ; Eisen, J. A. ; Heidelberg, J.F.; Hickey, E.K.; Peterson, J.D.; 
Dodson, R.J.; Haft, D.H.; Gwinn, M.L.; Nelson, W.C.; Richardson, D.L.; Moffat, 



K.S.; Qin, H.; Jiang, L. ; Pamphile, W. ; Crosby, M. ; Shen, M. ; Vamathevan, J. J.; 
Lam, P.; McDonald, L. ; Utterback, T . ; Zalewski, C; Makarova, K.S.; Aravind, L. ; 
Daly, M.J.; Minton, K.W. ; Fleischmann, R.D.; Ketchum, K.A. ; Nelson, K.E.; 
Salzberg, S.; Smith, H.O.; Venter, J.C.; Fraser, CM. 
Science 286, 1571-1577, 1999 

A; Title: Genome sequence of the radioresistant bacterium Deinococcus radiodurans 
Rl. 

A; Reference number: A75250; MUID : 20036896; PMID : 10567266 
A; Accession: F75294 
A; Status: preliminary 
A;Molecule type: DNA 
A; Residues: 1-840 <WHI> 

A; Cross-references: GB:AE002059; GB:AE000513; NID : g6460070 ; PIDN: AAF11813 . 1; 

PID:g6460073; TIGR:DR2267; GSPDB : GN00077 

A; Experimental source: strain Rl 

C; Genetics : 

A; Gene: DR22 67 

A;Map position: 1 

Query Match 70.7%; Score 29; DB 2; Length 840; 

Best Local Similarity 62.5%; Pred. No. 4.9e+02; 

Matches 5; Conservative 1; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

: I I I I I 

Db 195 MAMFAEDF 202 



RESULT 8 8 
CHRTD1 

potassium channel protein drkl - rat 

C; Species: Rattus norvegicus (Norway rat) 

C;Date: 30-Sep-1991 #sequence_revision 30-Sep-1991 #text_change 22-Jun-1999 
C;Accession: S05448; A44838 

R; Freeh, G.C.; VanDongen, A.M.J. ; Schuster, G. ; Brown, A.M.; Joho, R.H. 
Nature 340, 642-645, 1989 

A; Title: A novel potassium channel with delayed rectifier properties isolated 
from rat brain by expression cloning. 

A; Reference number: S05448; MUID: 89365157 ; PMID:2770868 
A; Accession: S05448 
A; Molecule type: mRNA 
A; Residues: 1-853 <FRE> 

A;Cross-references : EMBL:X16476; NID:g57785; PIDN : CAA344 97 . 1 ; PID:g57786 
A;Note: it is uncertain whether Met-1 or Met-17 is the initiator 
R;Drewe, J. A. ; Verma, S.; Freeh, G. ; Joho, R.H. 
J. Neurosci. 12, 538-548, 1992 

A; Title: Distinct spatial and temporal expression patterns of K+ channel mRNAs 
from different subfamilies. 

A; Reference number: A44838; MUID: 92156897 ; PMID: 1740690 
A; Accession: A44 838 

A; Status: preliminary; not compared with conceptual translation 

A; Molecule type: mRNA 

A; Residues: 'MPAG 1 , 1-571 <DRE> 

A; Cross-references: GB:M81783; NID:g205038 

A; Experimental source: brain 

A; Note: sequence extracted from NCBI backbone (NCBIP : 81768 ) 
C; Genetics : 



A; Gene: drkl 

C; Superf amily : potassium channel protein drkl 

C; Keywords: glycoprotein; ion channel; phosphoprotein; potassium channel; 
transmembrane protein 

F; 1-182/Domai n: intracellular {(status predicted <INT1> 

F; 183-204/Domain : transmembrane #status predicted <TM1> 

F;225-245/Domain: transmembrane #status predicted <TM2> 

F;256-276/Domain: transmembrane #status predicted <TM3> 

F; 291-312/Domain : transmembrane #status predicted <TM4> 

F; 327-348/Domain: transmembrane ((status predicted <TM5> 

F; 389-410/Domain: transmembrane #status predicted <TM6> 

F; 411-853/Domain: intracellular #status predicted <INT2> 

F; 279/Binding site: carbohydrate (Asn) (covalent) ftstatus predicted 

Query Match 70.7%; Score 29; DB 1; Length 853; 

Best Local Similarity 100.0%; Pred. No. 5e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 LVFFAE 6 

I I I I I I 

Db 344 LVFFAE 349 



RESULT 8 9 
156529 

potassium channel protein - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 26-Jul-1996 #sequence_revision 26-Jul-1996 #text_change 20-Aug-1999 
C;Accession: 156529 

R;Pak, M.D.; Covarrubias, M. ; Ratcliffe, A.; Salkoff, L. 
J. Neurosci. 11, 869-880, 1991 

A; Title: A mouse brain homolog of the Drosophila Shab K+ channel with conserved 
delayed-rectif ier properties. 

A;Reference number: 156529; MUID : 91162315 ; PMID : 2 002364 
A;Accession: 156529 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-857 <RES> 

A; Cross-references: GB:M64228; NID:g200975; PIDN : AAA40112 . 1 ; PID:g200976 
C; Genetics : 
A; Gene: Shab 

C; Superf amily : potassium channel protein drkl 

Query Match 70.7%; Score 29; DB 2; Length 857; 

Best Local Similarity 100.0%; Pred. No. 5e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 LVFFAE 6 

I I I I I I 

Db 348 LVFFAE 353 



RESULT 90 
S31761 

potassium channel protein DRK1 - human 
C; Species: Homo sapiens (man) 

C;Date: 13-Jan-1995 (fsequence revision 13-Jan~1995 #text change 20-Aug-1999 



1 



C;Accession: S31761 

R;Albrecht, B.; Lorra, C; Stocker, K. ; Pongs, 0. 
submitted to the EMBL Data Library, September 1992 

A; Description: Cloning, expression and chromosomal localization of the delayed 

rectifier type K+ channel human 2.1 (h-DRKl) gene. 

A; Reference number: S31761 

A;Accession: S31761 

A; Molecule type: DNA 

A; Residues: 1-858 <ALB> 

A; Cross-references: EMBL:X68302; NID:g30892; PIDN : CAA48374 . 1 ; PID:g30893 
C; Genetics : 

A; Gene: GDB : KCNB1 ; KV2.1; DRK1 

A; Cross-references: GDB: 128081; OMIM: 600397 

A;Map position: 20ql3 . 2-20ql3 . 2 

C; Superf amily : potassium channel protein drkl 

Query Match 70.7%; Score 29; DB 2; Length 858; 

Best Local Similarity 100.0%; Pred. No. 5.1e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 LVFFAE 6 

I I I I I I 

Db 348 LVFFAE 353 



RESULT 91 
A69329 

probable isoleucine-tRNA ligase (EC 6.1.1.5) - Archaeoglobus fulgidus 
C; Species: Archaeoglobus fulgidus 

C;Date: 05-Dec-1997 #sequence_revision 05-Dec-1997 #text_change 03-Jun-2002 
C;Accession: A69329 

R;Klenk, H.P.; Clayton, R.A. ; Tomb, J.F.; White, O. ; Nelson, K.E.; Ketchum, 
K.A. ; Dodson, R.J.; Gwinn, M. ; Hickey, E.K.; Peterson, J.D.; Richardson, D.L.; 
Kerlavage, A.R.; Graham, D.E.; Kyrpides, N.C.; Fleischmann, R.D.; Quackenbush, 
J.; Lee, N.H.; Sutton, G.G.; Gill, S.; Kirkness, E.F.; Dougherty, B.A. ; McKenny, 
K.; Adams, M.D.; Loftus, B.; Peterson, S.; Reich, C.I.; McNeil, L.K.; Badger, 
J.H.; Glodek, A.; Zhou, L.; Overbeek, R. ; Gocayne, J.D.; Weidman, J.F.; 
McDonald, L. 

Nature 390, 364-370, 1997 

A; Authors: Utterback, T.; Cotton, M.D.; Spriggs, T.; Artiach, P.; Kaine, B.P.; 
Sykes, S.M.; Sadow, P.W.; D f Andrea, K.P.; Bowman, C; Fujii, C; Garland, S.A. ; 
Mason, T.M.; Olsen, G. J.; Fraser, CM. ; Smith, H.O.; Woese, C.R.; Venter, J.C. 
A; Title: The complete genome sequence of the hyperthermophilic, sulf ate-reducing 
archaeon Archaeoglobus fulgidus . 

A; Reference number: A69250; MUID: 98049343; PMID: 9389475 
A;Accession: A69329 

A; Status: preliminary; nucleic acid sequence not shown; translation not shown 

A; Molecule type: DNA 

A; Residues: 1-1018 <KLE> 

A; Cross-references: GB:AE001061; GB:AE000782; NID : g2689384 ; PIDN : AAB90608 . 1 ; 

PID:g2649988; TIGR:AF0633 

C; Superf amily : isoleucine-tRNA ligase 

C; Keywords: ligase 



Query Match 70.7%; Score 29; DB 2; Length 1018; 

Best Local Similarity 83.3%; Pred. No. 6e+02; 

Matches 5; Conservative 0; Mismatches 1; Indels 0; Gaps 



0; 



Qy 3 FFAEDF 8 

II III 

Db 700 FFVEDF 705 



RESULT 92 
S72163 

methyl-accepting chemotaxis protein mcpA - Rhizobium leguminosarum bv, viciae 
plasmid 

N;Alternate names: chemoreceptor protein 

C; Species: Rhizobium leguminosarum bv. viciae 

C;Date: 04-Dec-1997 #sequence_revision 12-Dec-1997 #text_change 29-Sep-1999 
C;Accession: S72163 

R;Brito, B. ; Palacios, J.M.; Ruiz-Argueeso, T.; Imperial, J. 
Biochim. Biophys , Acta 1308, 7-11, 1996 

A; Title: Identification of a gene for a chemoreceptor of the methyl-accepting 
type in the symbiotic plasmid of Rhizobium leguminosarum bv. viciae UPM791. 
A; Reference number: S72162; MUID: 96328256; PMID: 8765742 
A;Accession: S72163 
A; Molecule type: DNA 
A; Residues: 1-639 <BRI> 

A; Cross-references: EMBL:U23040; NID:g780654; PIDN : AAC44311 . 1 ; PID:g780656 

A; Experimental source: strain UPM7 91 

C; Genetics : 

A; Gene: mcpA 

A; Genome: plasmid 

A; Start codon: GTG 

C; Super family : methyl-accepting chemotaxis protein mcpA 

C; Keywords: chemotaxis; sensory transduction; transmembrane protein 

Query Match 69.5%; Score 28.5; DB 2; Length 639; 

Best Local Similarity 61.5%; Pred. No. 4.8e+02; 

Matches 8; Conservative 0; Mismatches 0; Indels 5; Gaps 1; 

Qy 1 LVF FAEDF 8 

III I I I I I 

Db 315 LVFSLDDKFAEDF 327 



RESULT 93 
MNIHB2 

nonstructural protein NS2 - bovine coronavirus 
C; Species: bovine coronavirus 

C;Date: 31-Mar-1991 #sequence_revision 31-Mar-1991 #text__change 16-Jul-1999 
C;Accession: S08408; C46346 . 
R;Woloszyn, N. ; Boireau, P.; Laporte, J. 
Nucleic Acids Res. 18, 1303, 1990 

A; Title: Nucleotide sequence of the bovine enteric coronavirus BECV F15 mRNA 5 
and mRNA 6 unique regions . 

A; Reference number: S08408; MUID: 90206809; PMID:2320429 
A; Accession: S08408 
A; Molecule type: mRNA 
A; Residues: 1-109 <WOL> 

A; Cross-references: EMBL:X51347; NID:g58684; PIDN : CAA35740 . 1 ; PID:g58685 
A; Experimental source: strain F15 

A; Note: the source is designated as bovine enteric coronavirus 



R;Abraham, S.; Kienzle, T.E.; Lapps, W.E.; Brian, D.A. 
Virology 177, 488-495, 1990 

A;Title: Sequence and expression analysis of potential nonstructural proteins o 
4.9, 4.8, 12.7, and 9.5 kDa encoded between the spike and membrane protein gene 
of the bovine coronavirus . 

A;Reference number: A46346; MUID : 9032 0120 ; PMID:2142556 
A;Accession: C46346 
A;Molecule type: genomic RNA 
A;Residues: 1-109 <ABR> 

A;Cross-references: GB:M31054; NID:g323363; PIDN: AAA42913 . 1; PID:g323367 
A; Experimental source: strain Mebus 

C; Super family : bovine coronavirus nonstructural protein NS2 
C;Keywords: nonstructural protein 

Query Match 68.3%; Score 28; DB 1; Length 109; 

Best Local Similarity 75.0%; Pred. No. 98; 

Matches 6; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

II I I II 

Db 90 LVFLAVDF 97 



RESULT 94 
A44275 

nonstructural protein NS2 - human coronavirus (strain OC43) 
C; Species : human coronavirus 

C;Date: 17-Feb-1994 #sequence_revision 17-Feb-1994 #text_change 16-Jul-1999 
C; Access ion: A4 4275 
R;Mounir, S.; Talbot, P.J. 
Virology 192, 355-360, 1993 

A; Title: Human coronavirus OC4 3 RNA 4 lacks two open reading frames located 

downstream of the S gene of bovine coronavirus. 

A; Reference number: A44275; MUID: 93297129; PMID: 8517026 

A; Access ion: A44275 

A;Molecule type : mRNA 

A; Residues: 1-109 <MOU> 

A;Cross-references: GB:M99576; NID:g329567; PIDN : AAA02569 . 1 ; PID:g329568 
C; Superf amily : bovine coronavirus nonstructural protein NS2 
C; Keywords : nonstructural protein 

Query Match 68.3%; Score 28; DB 1; Length 109; 

Best Local Similarity 75.0%; Pred. No. 98; 

Matches 6; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

III I II 
Db 9 0 LVFLAVDF 97 



RESULT 95 
S58182 

nonstructural protein, 12. 8K - porcine hemagglutinating encephalomyelitis virus 
(strain 67N and strain NT9) 

C; Species: porcine hemagglutinating encephalomyelitis virus 
A;Variety: strain 67N; strain NT9 

C;Date: 19-Mar-1997 ((sequence revision 19-Mar-1997 #text_change 20-Sep-1999 



C; Accession: S58182; S58184 

R;Vieler, E . ; Schlapp, T. ; Herbst, W. 

submitted to the EMBL Data Library, July 1995 

A;Reference number: S58182 

A;Accession: S58182 

A;Molecule type: DNA 

A; Residues: 1-109 <VIE> 

A;Cross-references: EMBL:X89861; NID:g927492; PIDN : CAA61957 . 1 ; PID:g927493 
A; Experimental source: strain 67N 
A; Accession : S58184 
A; Molecule type: DNA 
A; Residues: 1-109 <VIW> 

A;Cross-references: EMBL:X89863; NID:g927495; PIDN : CAA61961 . 1 ; PID:g927496 
A; Experimental source: strain NT9 

C; Superf amily : bovine coronavirus nonstructural protein NS2 
C; Keywords: nonstructural protein 

Query Match 68.3%; Score 28; DB 2; Length 109; 

Best Local Similarity 75.0%; Pred. No. 98; 

Matches 6; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

III I II 

Db 90 LVFLAVDF 97 



RESULT 96 
S58186 

nonstructural protein, 12. 8K - porcine hemagglutinating encephalomyelitis virus 
(strain VW572) 

C; Species: porcine hemagglutinating encephalomyelitis virus 
A;Variety: strain VW572 

C;Date: 19-Mar-1997 #sequence_revision 19-Mar-1997 #text_ change 20-Sep-1999 
C;Accession: S58186 

R;Vieler, E . ; Schlapp, T . ; Herbst, W. 
submitted to the EMBL Data Library, July 1995 
A;Reference number: S58182 
A;Accession: S58186 
A; Molecule type: DNA 
A; Residues: 1-109 <VIE> 

A;Cross-references: EMBL:X89862; NID:g927498; PIDN : CAA61959 . 1 ; PID:g927499 
A; Experimental source: strain VW572 

C; Superf amily : bovine coronavirus nonstructural protein NS2 

Query Match 68.3%; Score 28; DB 2; Length 109; 

Best Local Similarity 75.0%; Pred. No. 98; 

Matches 6; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

III I II 
Db 90 LVFLAVDF 97 



RESULT 97 
R5HSS6 

ribosomal protein HS6 [validated] - Haloarcula marismortui 
C; Species: Haloarcula marismortui 



C;Date: 31-Mar-1991 #sequence__revision 31-Mar-1991 #text_change 21-Jul-2000 
C; Accession: S00182 

R;Kimura, J.; Arndt, E . ; Kimura, M. 
FEBS Lett. 224, 65-70, 1987 

A;Title: Primary structures of three highly acidic ribosomal proteins S6, S12 

and S15 from the archaebacterium Halobacterium marismortui . 

A; Reference number: S00182; MUID : 88055606 ; PMID:3315748 

A;Accession: S00182 

A;Molecule type: protein 

A; Residues: 1-116 <KIM> 

A;Note: the source is designated as Halobacterium marismortui 

A;Note: the protein is designated as ribosomal protein S6 

A;Note: the protein was extracted from the 30S small ribosomal subunit 

C; Superfamily : rat ribosomal protein L7a 

C; Keywords: protein biosynthesis; ribosome 

Query Match 68.3%; Score 28; DB 1; Length 116; 

Best Local Similarity 85.7%; Pred. No. le+02; 

Matches 6; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LVFFAED 7 

III III 

Db 47 LVFVAED 53 



RESULT 98 
H84375 

Holliday junction resolvase [imported] - Halobacterium sp. NRC-1 
C; Species: Halobacterium sp. NRC-1 

C;Date: 02-Feb-2001 #sequence_revision 02-Feb-2001 #text_change 12-Jun-2003 
C; Accession: H84375 

R;Ng, W.V.; Kennedy, S.P.; Mahairas, G.G.; Berquist, B.; Pan, M. ; Shukla, H.D.; 
Lasky, S.R.; Baliga, N.; Thorsson, V.; Sbrogna, J.; Swartzell, S.; Weir, D.; 
Hall, J.; Dahl, T.A.; Welti, R. ; Goo, Y.A.; Leithauser, B.; Keller, K. ; Cruz, 
R. ; Danson, M.J.; Hough, D.W. ; Maddocks, D.G.; Jablonski, P.E.; Krebs, M.P.; 
Angevine, CM. ; Dale, H. ; Isenbarger, T.A.; Peck, R.F.; Pohlschrod, M. ; Spudich, 
J.L.; Jung, K.H.; Alam, M. ; Freitas, T. 
Proc. Natl. Acad. Sci. U.S.A. 97, 12176-12181, 2000 

A;Authors: Hou, S.; Daniels, C.J.; Dennis, P.P.; Omer, A.D.; Ebhardt, H.; Lowe, 

T.M.; Liang, P.; Riley, M. ; Hood, L. ; DasSarma, S. 

A; Title: Genome sequence of Halobacterium species NRC-1. 

A; Reference number: A84160; MUID : 20504483 ; PMID : 11016950 

A; Accession: H84375 

A; Status : preliminary 

A;Molecule type: DNA 

A; Residues: 1-141 <STO> 

A; Cross-references: GB:AE004437; NID : gl0581665 ; PIDN : AAG2 0372 . 1 ; GSPDB : GN00138 
C; Genetics : 
A; Gene: hjr 

C; Superfamily : uncharacterized conserved protein MJ0497 

Query Match 68.3%; Score 28; DB 2; Length 141; 

Best Local Similarity 62.5%; Pred. No. 1.3e+02; 

Matches 5; Conservative 2; Mismatches 1; Indels 0; Gaps 0; 



Qy 



1 LVFFAED F 8 
I I : I I : I 



Db 



74 LVYFARNF 81 



RESULT 99 
C83475 

hypothetical protein PA1354 [imported] - Pseudomonas aeruginosa (strain PA01) 
C; Species: Pseudomonas aeruginosa 

C;Date: 15-Sep-2000 #sequence_revision 15-Sep-2000 #text_change 31-Dec-2000 
C; Access ion: C83475 

R; Stover , C.K.; Pham, X.Q.; Erwin, A.L.; Mizoguchi, S.D.; Warrener, P.; Hickey, 
M.J.; Brinkman, F.S.L.; Hufnagle, W.O.; Kowalik, D.J.; Lagrou, M. ; Garber, R.L.; 
Goltry, L.; Tolentino, E. ; Westbrook-Wadman, S.; Yuan, Y.; Brody, L.L.; Coulter, 
S.N.; Folger, K.R.; Kas, A.; Larbig, K. ; Lim, R.M. ; Smith, K.A. ; Spencer, D.H.; 
Wong, G.K.S.; Wu, Z.; Paulsen, I.T.; Reizer, J.; Saier, M.H.; Hancock, R.E.W.; 
Lory, S.; Olson, M.V. 
Nature 406, 959-964, 2000 

A;Title: Complete genome sequence of Pseudomonas aeruginosa PA01, an 
opportunistic pathogen. 

A;Reference number: A82950; MUID : 20437337 ; PMID: 10984043 
A;Accession : C83475 
A; Status: preliminary 
A;Molecule type: DNA 
A; Residues: 1-141 <STO> 

A;Cross-references : GB:AE004565; GB:AE004091; NID: g9947294 ; PIDN: AAG04743 . 1; 

GSPDB:GN00131; PASP: PA1354 

A; Experimental source: strain PAOl 

C; Genetics : 

A; Gene: PA1354 

Query Match 68.3%; Score 28; DB 2; Length 141; 

Best Local Similarity 85.7%; Pred. No. 1.3e+02; 

Matches 6; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 2 VFFAEDF 8 

II I I I I 
Db 112 VFEAEDF 118 



RESULT 100 
S76052 

hypothetical protein - Synechocystis sp. (strain PCC 6803} 
C; Species: Synechocystis sp. 
A;Variety: PCC 6803 

C;Date: 25-Apr-1997 #sequence_revision 25-Apr-1997 #text_change 20-Jun-2000 
C'Accession: S76052 

R;Kaneko, T.; Sato, S.; Kotani, H.; Tanaka, A.; Asamizu, E. ; Nakamura, Y.; 
Miyajima, N.; Hirosawa, M. ; Sugiura, M. ; Sasamoto, S.; Kimura, T.; Hosouchi, T.; 
Matsuno, A.; Muraki, A.; Nakazaki, N.; Naruo, K. ; Okumura, S.; Shimpo, S.; 
Takeuchi, C; Wada, T.; Watanabe, A.; Yamada, M. ; Yasuda, M. ; Tabata, S. 
DNA Res. 3, 109-136, 1996 

A; Title: Sequence analysis of the genome of the unicellular cyanobacterium 

Synechocystis sp. PCC6803. II. Sequence determination of the entire genome and 

assignment of potential protein-coding regions. 

A;Reference number: S74322; MUID: 97061201; PMID:8905231 

A; Accession: S76052 

A; Status: preliminary 

A; Molecule type: DNA 



A; Residues: 1-157 <KAN> 

A;Cross-references: EMBL:D63999; GB:AB001339; NID : gl001396; PIDN : BAA10030 . 1 ; 
PID:gl001408 

A; Note: the nucleotide sequence was submitted to the EMBL Data Library, June 
1996 

C; Superf amily : protein-tyrosine-phosphatase, low molecular weight 

Query Match 68.3%; Score 28; DB 2; Length 157; 

Best Local Similarity 83.3%; Pred. No. 1.4e+02; 

Matches 5; Conservative 0; Mismatches 1; Indels 0; Gaps 



Qy 3 FFAEDF 8 

II III 

Db 72 FFPEDF 77 



Search completed: February 28, 2004, 08:56:50 
Job time : 41 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on : 



February 28, 2004, 07:06:39 ; Search time 21.5 Seconds 

(without alignments) 
19.375 Million cell updates/sec 



Title: US-09-668-314C-84 

Perfect score: 41 

Sequence: 1 LVFFAEDF 8 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 141681 seqs, 52070155 residues 

Total number of hits satisfying chosen parameters 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 

Listing first 1000 summaries 



141681 



Database : 



SwissProt 42:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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995 


24 
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996 


24 


58. 
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575 
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SMFl YEAST 
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ALIGNMENTS 



RESULT 1 
A4JJRSMA 

ID A4JJRSMA STANDARD; PRT; 57 AA. 

AC Q29149; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 

DT 30-MAY-2000 (Rel. 39, Last annotation update) 

DE Alzheimer 1 s disease amyloid A4 protein homolog [Contains: Beta-amyloid 

DE protein (Beta-APP) (A-beta)] (Fragment). 

GN APP. 

OS Ursus maritimus (Polar bear) (Thalarctos maritimus). 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Carnivora; Fissipedia; Ursidae; Ursus. 

OX NCBI_TaxID=29073; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE^Brain; 

RX MEDLINE= 92017079; PubMed= 1656157; 

RA Johnstone E.M., Chaney M.O., Norris F.H., Pascual R. , Little S.P.; 

RT "Conservation of the sequence of the Alzheimer 1 s disease amyloid 

RT peptide in dog, polar bear and five other mammals by cross-species 

RT polymerase chain reaction analysis."; 

RL Brain Res. Mol . Brain Res. 10:2 99-305(1991). 

CC -!- FUNCTION: Functional neuronal receptor which couples to 

CC intracellular signaling pathway through the GTP-binding protein 

CC G(O) (By similarity). 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- SIMILARITY: Belongs to the APP family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X56128; CAA39593.1; -. 



DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
SQ 



PIR; B60045; B60045. 

HSSP; P05067; 1BA4 . 

InterPro; IPR008155; A4_APP. 

InterPro; IPR001255; Beta-APP. 

Pfam; PF03494; Beta-APP; 1. 

PROSITE; PS00319; A4_EXTRA; PARTIAL. 

PROSITE; PS00320; A4_INTRA; PARTIAL. 

Glycoprotein; Amyloid; Neurone; Transmembrane. 



NONJTER 

CHAIN 

DOMAIN 

TRANSMEM 

NONJTER 

SEQUENCE 



1 

6 
<1 
34 
57 



1 
48 
33 
57 
57 



BETA-AMYLOID PROTEIN (POTENTIAL). 
EXTRACELLULAR (POTENTIAL) . 
POTENTIAL. 



57 AA; 6172 MW; 8 4209D8 8EBA82DFA CRC64 ; 



Query Match 85.4%; Score 35; DB 1; Length 57; 

Best Local Similarity 100.0%; Pred. No. 0.73; 

Matches 7; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 

Db 



1 LVFFAED 7 

I I I I I I I 
22 LVFFAED 28 



RESULT 2 
A4_CANFA 

ID A4_CANFA STANDARD; PRT; 58 AA. 

AC Q28280; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 

DT 30-MAY-2000 (Rel. 39, Last annotation update) 

DE Alzheimer's disease amyloid A4 protein homolog [Contains: Beta-amyloid 

DE protein (Beta-APP) (A-beta) ] (Fragment). 

GN APP . 

OS Canis familiaris (Dog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; Canis. 

OX NCBI_TaxID=9615; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Kidney; 

RX MEDLINE=92017079; PubMed=1656157 ; 

RA Johnstone E.M., Chaney M.O., Norris F.H., Pascual R., Little S.P.; 

RT "Conservation of the sequence of the Alzheimer's disease amyloid 

RT peptide in dog, polar bear and five other mammals by cross-species 

RT polymerase chain reaction analysis."; 

RL Brain Res. Mol . Brain Res. 10:299-305(1991). 

CC -!- FUNCTION: Functional neuronal receptor which couples to 

CC intracellular signaling pathway through the GTP-binding protein 

CC G(O) (By similarity). 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- SIMILARITY: Belongs to the APP family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 



CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X56125; CAA39590.1; 

DR HSSP; P05067; 1BA4 . 

DR InterPro; IPR008155; A4_APP. 

DR InterPro; IPR001255; Beta-APP. 

DR Pfam; PF03494; Beta-APP; 1. 

DR PROSITE; PS00319; A4_EXTRA; PARTIAL. 

DR PROSITE; PS00320; A4__INTRA; PARTIAL. 

KW Glycoprotein; Amyloid; Neurone; Transmembrane. 



FT 


NON_TER 


1 


1 




FT 


CHAIN 


7 


49 


BETA- AMYLOID PROTEIN (POTENTIAL) 


FT 


DOMAIN 


<1 


34 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


35 


58 


POTENTIAL. 


FT 


NON_TER 


58 


58 




SQ 


SEQUENCE 


58 AA; 


6285 MW; 


8469D488A2E12DFA CRC64; 



Query Match 85.4%; Score 35; DB 1; Length 58; 

Best Local Similarity 100.0%; Pred. No. 0.74; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 LVFFAED 7 

I I I I I I 1 
Db 23 LVFFAED 29 



RESULT 3 
A4_RABIT 

ID A4_RABIT STANDARD; PRT; 58 AA. 

AC Q28748; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Alzheimer's disease amyloid A4 protein homolog [Contains: Beta-amyloid 

DE protein (Beta-APP) (A-beta) ] (Fragment). 

GN APP. 

OS Oryctolagus cuniculus (Rabbit) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

OX NCBI_TaxID=998 6 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=92017 07 9; PubMed=1656157 ; 

RA Johnstone E.M., Chaney M.O., Norris F.H., Pascual R. , Little S.P.; 

RT "Conservation of the sequence of the Alzheimer's disease amyloid 

RT peptide in dog, polar bear and five other mammals by cross-species 

RT polymerase chain reaction analysis."; 

RL Brain Res. Mol. Brain Res. 10:299-305(1991). 

CC -!- FUNCTION: Functional neuronal receptor which couples to 

CC intracellular signaling pathway through the GTP-binding protein 

CC G(O) (By similarity). 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- SIMILARITY: Belongs to the APP family. 

CC 



CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X56129; CAA39594.1; -. 

DR HSSP; P05067; 1BA4. 

DR InterPro; IPR008155; A4_APP. 

DR InterPro; IPR001255; Beta-APP. 

DR Pfam; PF03494; Beta-APP; 1. 

DR PROSITE; PS00319; A4_EXTRA; PARTIAL. 

DR PROSITE; PS00320; A4_INTRA; PARTIAL. 

KW Glycoprotein; Amyloid; Neurone; Transmembrane. 



FT NON TER 


1 


1 




FT CHAIN 


6 


48 


BETA- AMYLOID PROTEIN (POTENTIAL) 


FT DOMAIN 


<1 


33 


EXTRACELLULAR (POTENTIAL) . 


FT TRANSMEM 


34 


57 


POTENTIAL. 


FT DOMAIN 


58 


>58 


CYTOPLASMIC (POTENTIAL) . 


FT NON_TER 


58 


58 




SQ SEQUENCE 


58 AA; 


6300 MW; 


F434209D88EBA82D CRC64; 


Query Match 




85.4%; 


Score 35; DB 1; Length 58; 


Best Local , 


Similarity 


100.0%; 


Pred. No. 0.74; 



Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 LVFFAED 7 

I I I I I I I 

Db 22 LVFFAED 28 



RESULT 4 
A4_SHEEP 

ID A4_SHEEP STANDARD; PRT; 58 AA. 

AC Q28757; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 

DT 30-MAY-2000 (Rel. 39, Last annotation update) 

DE Alzheimer's disease amyloid A4 protein homolog [Contains: Beta-amyloid 

DE protein (Beta-APP) (A-beta) ] (Fragment). 

GN APP. 

OS Ovis aries (Sheep) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla; Ruminantia; Pecora; Bovoidea; 

OC Bovidae; Caprinae; Ovis. 

OX NCBI_TaxID=9940; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE^Heart; 

RX MEDLINE=92017079; PubMed=1656157 ; 

RA Johnstone E.M., Chaney M.O., Norris F.H., Pascual R. , Little S.P.; 

RT "Conservation of the sequence of the Alzheimer's disease amyloid 

RT peptide in dog, polar bear and five other mammals by cross-species 

RT polymerase chain reaction analysis."; 

RL Brain Res. Mol. Brain Res. 10:299-305(1991), 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 



I _ 

I _ 



FUNCTION: Functional neuronal receptor which couples to 
intracellular signaling pathway through the GTP-binding protein 
G(O) (By similarity) . 

SUBCELLULAR LOCATION: Type I membrane protein. 
SIMILARITY: Belongs to the APP family. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in- no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 



EMBL; X56130; CAA39595.1; 



DR 


HSSP; P05067; 1BA4 . 






DR 


InterPro; 


IPR008155; 


A4_APP. 




DR 


InterPro; 


IPR001255; 


Beta-APP. 


DR 


Pfam; PF03494; Beta- 


APP; 1. 




DR 


PROSITE; 


PS00319; A4 


EXTRA; 


PARTIAL. 


DR 


PROSITE; 


PS00320; A4 


INTRA; 


PARTIAL. 


KW 


Glycoprotein; Amyloid; Neurone; Transmembrane. 


FT 


NON_TER 


1 


1 




FT 


CHAIN 


6 


48 


BETA- AMYLOID PROTEIN (POTENTIAL) 


FT 


DOMAIN 


<1 


33 


EXTRACELLULAR (POTENTIAL). 


FT 


TRANSMEM 


34 


57 


POTENTIAL. 


FT 


DOMAIN 


58 >58 


CYTOPLASMIC (POTENTIAL) . 


FT 


NON_TER 


58 


58 




SQ 


SEQUENCE 


58 AA; 6300 MW; 


F434209D88EBA82D CRC64; 



Query Match 85.4%; 
Best Local Similarity 100.0% 
Matches 7; Conservative 



Score 35; DB 1; Length 58; 
Pred. No. 0.74; 
0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 

Db 



1 LVFFAED 7 

I I I I I I I 

22 LVFFAED 2 8 



RESULT 5 




A4_ 


BOVIN 




ID 


A4 BOVIN STANDARD;. PRT; 59 AA. 




AC 


Q28053; 




DT 


01-NOV-1997 (Rel. 35, Created) 




DT 


01-NOV-1997 (Rel. 35, Last sequence update) 




DT 


30-MAY-2000 (Rel. 39, Last annotation update) 




DE 


Alzheimer's disease amyloid A4 protein homolog [Contains 


: Beta-amyloid 


DE 


protein (Beta-APP) (A-beta) ] (Fragment). 




GN 


APP. 




OS 


Bos taurus (Bovine) . 




OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 


OC 


Mammalia; Eutheria; Cetartiodactyla ; Ruminantia; Pecora; 


Bovoidea ; 


OC 


Bovidae; Bovinae; Bos. 




OX 


NCBI TaxID=9913; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RC 


TISSUE-Brain; 





RX MEDLINE=92017079; PubMed=1656157 ; 

RA Johnstone E.M., Chaney M.O., Norris F.H., Pascual R. , Little S.P.; 

RT "Conservation of the sequence of the Alzheimer's disease amyloid 

RT peptide in dog, polar bear and five other mammals by cross-species 

RT polymerase chain reaction analysis."; 

RL Brain Res. Mol . Brain Res. 10:299-305(1991). 

CC FUNCTION: Functional neuronal receptor which couples to 

CC intracellular signaling pathway through the GTP-binding protein 

CC G(O) (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- SIMILARITY: Belongs to the APP family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X56124; CAA39589.1; 

DR EMBL; X56126; CAA39591.1; -. 

DR HSSP; P05067; 1BA4 . 

DR InterPro; IPR008155; A4_APP. 

DR InterPro; IPR001255; Beta-APP. 

DR Pfam; PF03494; Beta-APP; 1. 

DR PROSITE; PS00319; A4_EXTRA; PARTIAL. 

DR PROSITE; PS00320; A4_INTRA; PARTIAL . 

KW Glycoprotein; Amyloid; Neurone; Transmembrane. 



FT NON TER 


1 


1 




FT CHAIN 


7 


49 


BETA- AMYLOID PROTEIN (POTENTIAL) 


FT DOMAIN 


<1 


34 


EXTRACELLULAR (POTENTIAL) . 


FT TRANSMEM 


35 


58 


POTENTIAL. 


FT DOMAIN 


59 


>59 


CYTOPLASMIC (POTENTIAL) . 


FT N0N_TER 


59 


59 




SQ SEQUENCE 


59 AA; 


6414 MW; 


F43469D488A2E12D CRC64 ; 


Query Match 




85.4%; 


Score 35; DB 1; Length 59; 


Best Local 


Similarity 


100.0%; 


Pred. No. 0.76; 



Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 LVFFAED 7 

I I I I I I I 

Db 23 LVFFAED 29 



RESULT 6 
A4_SAISC 

ID A4_SAISC STANDARD; PRT; 751 AA. 

AC Q95241; 

DT 15-DEC-1998 (Rel. 37, Created) 

DT 15-DEC-1998 (Rel. 37, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Amyloid beta A4 protein precursor (APP) (Alzheimer's disease amyloid 
DE protein homolog) [Contains: Soluble APP-alpha (S-APP-alpha) ; Soluble 
DE APP-beta (S-APP-beta) ; C99; Beta-amyloid protein 42 (Beta-APP42) ; 
DE Beta-amyloid protein 40 (Beta-APP40) ; C83; P3(42); P3(40); Gamma- 



DE CTF(59) (Gamma-secretase C-terminal fragment 59); Gamma-CTF ( 57 ) 

DE (Gamma-secretase C-terminal fragment 57); Gamma-CTF ( 50 ) (Gamma- 

DE secretase C-terminal fragment 50) ; C31] . 

GN APP. 

OS Saimiri sciureus (Common squirrel monkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Platyrrhini; Cebidae; Cebinae; Saimiri. 

OX NCBI_TaxID=9521; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Kidney, and Liver; 

RX MEDLINE=96108492; PubMed-8532114 ; 

RA Levy E., Amorim A., Frangione B., Walker L.C.; 

RT "Beta-amyloid precursor protein gene in squirrel monkeys with 

RT cerebral amyloid angiopathy."; 

RL Neurobiol. Aging 16:805-808(1995). 

CC -!- FUNCTION: Functions as a cell surface receptor and performs 

CC physiological functions on the surface of neurons relevant to 

CC neurite growth, neuronal adhesion and axonogenesis . Involved in 

CC cell mobility and transcription regulation through protein-protein 

CC interactions (By similarity) . Can promote transcription activation 

CC through binding to APBB1/Tip60 and inhibit Notch signaling through 

CC interaction with Numb (By similarity) . Couples to apoptosis- 

CC inducing pathways such as those mediated by G(O) and JIP (By 

CC similarity). Inhibits G(0) alpha ATPase activity (By similarity). 

CC Acts as a kinesin I membrane receptor, mediating the axonal 

CC transport of beta-secretase and presenilin 1 (By similarity) . May 

CC be involved in copper homeostasis/oxidative stress through copper 

CC ion reduction. In vitro, copper-metallated APP induces neuronal 

CC death directly or is potentiated through Cu (II ) -mediated low- 

CC density lipoprotein oxidation (By similarity) . Can regulate 

CC neurite outgrowth through binding to components of the 

CC extracellular matrix such as heparin and collagen I and IV (By 

CC similarity) . The splice isoforms that contain the BPTI domain 

CC possess protease inhibitor activity (By similarity) . 

CC -!- FUNCTION: Beta-amyloid peptides are lipophilic metal chelators 

CC with metal-reducing activity. Bind transient metals such as 

CC copper, zinc and iron (By similarity) . 

CC -!- FUNCTION: The gamma-CTF peptides as well as the caspase-cleaved 
CC peptides, including C31, are potent enhancers of neuronal 

CC apoptosis (By similarity) . 

CC -!- SUBUNIT: Binds, via its C-terminal, to the PID domain of several 

CC cytoplasmic proteins, including APBB family members, the APBA 

CC family, MAPK8IP1, and SHC1, Numb and Dabl (By similarity) . Binding 

CC to Dabl inhibits its serine phosphorylation (By similarity) . Also 

CC interacts with GPCR-like protein BPP, FPRL1, APPBPl, IBl, KNS2 

CC (via its TPR domains) (By similarity), APPBP2 (via BaSS) and DDB1. 

CC In vitro, it binds MAPT via the MT-binding domains (By 

CC similarity) . Associates with microtubules in the presence of ATP 

CC and in a kinesin-dependent manner (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. Cell surface 

CC protein that rapidly becomes internalized via clathrin-coated 

CC pits. During maturation, the immature APP (N-glycosylated in the 

CC endoplasmic reticulum) moves to the Golgi complex where complete 

CC maturation occurs (O-glycosylated and sulfated) . After alpha- 

CC secretase cleavage, soluble APP is released into the extracellular 

CC space and the C-terminal is internalized to endosomes and 



CC lysosomes. Some APP accumulates in secretory transport vesicles 

CC leaving the late Golgi compartment and returns to the cell 

CC surface. Gamma-CTF ( 59 ) peptide is located to both the cytoplasm 

CC and nuclei of neurons (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Comment-Additional isoforms seem to exist; 

CC Name=APP770; 

CC IsoId=Q95241-l; Sequence=Displayed; 

CC Name=APP695; 

CC IsoId=Q95241-2 ; Sequence=Not described; 

CC -!- DOMAIN: The basolateral sorting signal (BaSS) is required for 

CC sorting of membrane proteins to the basolateral surface of 

CC epithelial cells (By similarity) . 

CC -!- DOMAIN: The NPXY sequence motif found in many tyrosine- 

CC phosphorylated proteins is required for the specific binding of 

CC the PID domain. However additional amino acids either N- or C- 

CC terminal to the NPXY motif are often required for complete 

CC interaction. The PID domain-containing proteins which bind APP 

CC require the YENPTY motif for full interaction. These interactions 

CC are independent of phosphorylation on the terminal tyrosine 

CC residue. The NPXY site is also involved in clathrin-mediated 

CC endocytosis (By similarity) . 

CC -!- PTM: Proteolytically processed under normal cellular conditions. 

CC Cleavage by alpha-secretase or alternatively by beta-secretase 

CC leads to generation and extracellular release of soluble APP 

CC peptides , S-APP-alpha and S~APP-beta, respectively, and the 

CC retention of corresponding membrane-anchored C-terminal fragments, 

CC C83 and C99. Subsequent processing of C83 by gamma-secretase 

CC yields P3 peptides. This is the major secretory pathway and is 

CC nonamyloidogenic . Alternatively, presenilin/nicast rin-mediated 

CC gamma-secretase processing of C99 releases the amyloid beta 

CC proteins, amyloid-beta 40 (Abeta40) and amyloid-beta 42 (Abeta42), 

CC major components of amyloid plaques, and the cytotoxic C-terminal 

CC fragments, gamma-CTF ( 50 ) , gamma-CTF ( 57 ) and gamma-CTF (59 ) (By 

CC similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during neuronal apoptosis 

CC (By similarity) . Cleavage at Asp-720 by either caspase-3, -8 or -9 

CC results in the production of the neurotoxic C31 peptide and the 

CC increased production of beta-amyloid peptides (By similarity) . 

CC -!- PTM: N- and O-glycosylated (By similarity). 

CC -!- PTM: Phosphorylation in the C-terminal on tyrosine, threonine and 

CC serine residues is neuron-specific. Phosphorylation can affect APP 

CC processing, neuronal differentiation and interaction with other 

CC proteins (By similarity) . 

CC -!- MISCELLANEOUS: Chelation of metal ions, notably copper, iron and 

CC zinc, can induce histidine-bridging between beta-amyloid molecules 

CC resulting in beta-amyloid-metal aggregates (By similarity) . 

CC Extracellular zinc-binding increases binding of heparin to APP and 

CC inhibits collagen-binding (By similarity) . 

CC -!- SIMILARITY: Belongs to the APP family. 

CC -!- SIMILARITY: Contains 1 BPTI/Kunitz inhibitor domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 
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Kunitz BPTI; 1. 
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SMART; SM00006; A4_EXTRA; 1. 
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1. 
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PROSITE; 
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PROSITE; 


PS00280; 
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PROSITE; 


PS50279; 


BPTI KUNITZ 2; 1. 
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652 


653 


CLEAVAGE (BY BETA-SECRETASE ) 



FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 



SITE 
SITE 

SITE 

SITE 

SITE 

SITE 

SITE 

SITE 

SITE 

SITE 
SITE 



653 
668 

685 

687 

692 

694 

701 

705 

720 

738 
740 
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(BY SIMILARITY) . 

CLEAVAGE (BY CASPASE-6) (BY SIMILARITY) 
CLEAVAGE (BY ALPHA- SECRETASE) 
(BY SIMILARITY) . 

INVOLVED IN FREE RADICAL PROPAGATION 
(BY SIMILARITY) . 

INVOLVED IN OXIDATIVE REACTIONS 
(BY SIMILARITY) . 

CLEAVAGE (BY GAMMA- SECRETASE; SITE 1) 
(BY SIMILARITY) . 

CLEAVAGE (BY GAMMA- SECRETASE; SITE 2) 
(BY SIMILARITY) . 

CLEAVAGE (BY GAMMA- SECRETASE; SITE 3) 

(BY SIMILARITY) . 
BASOLATERAL SORTING SIGNAL 

(BY SIMILARITY) . 

CLEAVAGE (BY CASPASES-3, -6, -8 OR -9) 
(BY SIMILARITY) . 
ENDOCYTOSIS SIGNAL. 
NPXY MOTIF. 



Query Match 85.4%; Score 35; DB 1; Length 751; 

Best Local Similarity 100.0%; Pred. No. 9.6; 

Matches 7; Conservative 0; Mismatches 0; Indels 



0; Gaps 
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RESULT 7 
A4_CAVPO 

ID A4_CAVPO STANDARD; PRT; 770 AA. 

AC Q60495; Q60496; 

DT 10-OCT-2003 (Rel. 42, Created) 

DT 10-OCT-2003 (Rel. 42, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Amyloid beta A4 protein precursor (APP) (ABPP) (Alzheimer 1 s disease 

DE amyloid protein homolog) [Contains: Soluble APP-alpha (S-APP-alpha) ; 

DE Soluble APP-beta (S-APP-beta) ; CTF-alpha; CTF-beta; Beta-amyloid 

DE protein 42 (Beta-APP42 ) ; Beta-amyloid protein 40 (Beta-APP40) ; P3(42); 

DE P3(40); CTF(59) (Gamma-secretase C-terminal fragment 59); Gamma- 

DE CTF(57) (Gamma-secretase C-terminal fragment 57); C31] . 

GN APP. 

OS Cavia porcellus (Guinea pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Hystricognathi ; Caviidae; Cavia. 

OX NCBI_TaxID=10141; 

RN [1] 

RP SEQUENCE FROM N.A. , AND ALTERNATIVE SPLICING. 

RC TISSUE=Brain, and Liver; 

RX MEDLINE=97236426; PubMed=91 16031 ; 

RA Beck M. , Mueller D., Bigl V. ; 

RT "Amyloid precursor protein in Guinea pigs - complete cDNA sequence and 

RT alternative splicing."; 

RL Biochim. Biophys . Acta 1351:17-21(1997). 

RN [2] 



RP INTERACTION OF BETA-APP4 0 WITH APOE. 

RX MEDLINE=98 007700 ; PubMed=934 954 4 ; 

RA Martel C.L., Mackic J.B., Matsubara E., Governale S., Miguel C, 

RA Miao W. , McComb J.G., Frangione B., Ghiso J., Zlokovic B.V.; 

RT "Isof orm-specif ic effects of apolipoproteins E2, E3, and E4 on 

RT cerebral capillary sequestration and blood-brain barrier transport of 

RT circulating Alzheimer's amyloid beta."; 

RL J. Neurochem. 69:1995-2004(1997). 

RN [3] 

RP PROCESSING. 

RX MEDLINE=2 00844 99; PubMed=l 06194 8 1 ; 

RA Beck M. , Brueckner M.K., Holzer M . , Kaap S., Pannicke T., Arendt T., 

RA Bigl V. ; 

RT "Guinea-pig primary cell cultures provide a model to study expression 

RT and amyloidogenic processing of endogenous amyloid precursor 

RT protein. "; 

RL Neuroscience 95:243-254(2000). 

RN [4] 

RP GAMMA- S EC RETASE PROCESSING. 

RX MEDLINE=2057 6391; PubMed=11035007 ; 

RA Pinnix I., Musunuru U., Tun H., Sridharan A., Golde T., Eckman C, 

RA Ziani-Cherif C. , Onstead L. , Sambamurti K. ; 

RT "A novel gamma -secretase assay based on detection of the putative 

RT C-terminal fragment-gamma of amyloid beta protein precursor."; 

RL J. Biol. Chem. 27 6:481-4 87(2001). 

CC -!- FUNCTION: Functions as a cell surface receptor and performs 

CC physiological functions on the surface of neurons relevant to 

CC neurite growth, neuronal adhesion and axonogenesis . Involved in 

CC cell mobility and transcription regulation through protein-protein 

CC interactions (By similarity) . Can promote transcription activation 

CC through binding to APBB1/Tip60 and inhibit Notch signaling through 

CC interaction with Numb (By similarity) . Couples to apoptosis- 

CC inducing pathways such as those mediated by G(O) and JIP (By 

CC similarity). Inhibits G{0) alpha ATPase activity (By similarity). 

CC Acts as a kinesin I membrane receptor, mediating the axonal 

CC transport of beta-secretase and presenilin 1 (By similarity) . May 

CC be involved in copper homeostasis/oxidative stress through copper 

CC ion reduction (By similarity) . In vitro, copper-metallated APP 

CC induces neuronal death directly or is potentiated through Cu(II)- 

CC mediated low-density lipoprotein oxidation (By similarity) . Can 

CC regulate neurite outgrowth through binding to components of the 

CC extracellular matrix such as heparin and collagen I and IV (By 

CC similarity) . The splice isoforms that contain the BPTI domain 

CC possess protease inhibitor activity (By similarity) . 

CC -!- FUNCTION: Beta-amyloid peptides are lipophilic metal chelators 

CC with metal-reducing activity. Bind transient metals such as 

CC copper, zinc and iron. Beta-amyloid peptides bind to lipoproteins 

CC and apoliproteins E and J in the CSF and to HDL particles in 

CC plasma, inhibiting metal-catalyzed oxidation of lipoproteins. 

CC -!- FUNCTION: Appicans elicit adhesion of neural cells to the 

CC extracellular matrix and may regulate neurite outgrowth in the 

CC brain (By similarity) . 

CC -!- FUNCTION: The gamma-CTF peptides as well as the caspase-cleaved 
CC peptides, including C31, are potent enhancers of neuronal 

CC apoptosis (By similarity) . 

CC -!- SUBUNIT: Binds, via its C-terminal, to the PID domain of several 
CC cytoplasmic proteins, including APBB family members, the APBA 



CC family, MAPK8IP1, SHCl and Numb and Dabl (By similarity) . Also 

CC interacts with GPCR-like protein BPP, FPRL1, APPBP1, IB1, KNS2 

CC (via its TPR domains), APPBP2 (via BaSS) and DDB1 (By similarity). 

CC Associates with microtubules in the presence of ATP and in a 

CC kinesin-dependent manner (By similarity) . Soluble Abeta40 binds 

CC all three isoforms of APOE, in vitro and in vivo. When lipidated, 

CC ApoE3 appears to be the preferred amyloid binding isoform, while 

CC the apoE4 isoform-beta-APP4 0 complex is capable of being 

CC transported across the blood-brain barrier. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. Cell surface 

CC protein that rapidly becomes internalized via clathrin-coated pits 

CC (By similarity) . During maturation, the immature APP (N- 

CC glycosylated in the endoplasmic reticulum) moves to the Golgi 

CC complex where complete maturation occurs (O-glycosylated and 

CC sulfated) (By similarity) . After alpha-secretase cleavage, soluble 

CC APP is released into the extracellular space and the C-terminal is 

CC internalized to endosomes and lysosomes (By similarity) . Some APP 

CC accumulates in secretory transport vesicles leaving the late Golgi 

CC compartment and returns to the cell surface (By similarity) . APP 

CC sorts to the basolateral surface in epithelial cells (By 

CC similatity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Comment-Additional isoforms, missing exons 7,8 and 15, seem to 

CC exist. The L-isoforms, missing exon 15, are referred to as 

CC appicans; 

CC Name=APP77 0; 

CC IsoId=Q60495-l; Sequence=Displayed; 

CC Name=APP695; 

CC IsoId=Q60495-2; Sequence=VSP_007221, VSP_007222; 

CC -!- TISSUE SPECIFICITY: Isoform APP695 is the major isoform found in 

CC brain. The longer isoforms containing the BPTI domain are 

CC predominantly expressed in peripheral organs such as muscle and 

CC liver. 

CC -!- INDUCTION: Increased levels during neuronal differentiation. 
CC -!- DOMAIN: The basolateral sorting signal (BaSS) is required for 

CC sorting of membrane proteins to the basolateral surface of 

CC epithelial cells. 

CC -!~ DOMAIN: The NPXY sequence motif found in many tyrosine- 

CC phosphorylated proteins is required for the specific binding of 

CC the PID domain. However additional amino acids either N- or C- 

CC terminal to the NPXY motif are often required for complete 

CC interaction. The PID domain-containing proteins which bind APP 

CC require the YENPTY motif for full interaction. These interactions 

CC are independent of phosphorylation oh the terminal tyrosine 

CC residue (By similarity) . The NPXY site is also involved in 

CC clathrin-mediated endocytosis . 

CC -!- PTM: Proteolytically processed under normal cellular conditions. 

CC Cleavage by alpha-secretase or alternatively by beta-secretase 

CC leads to generation and extracellular release of soluble APP 

CC peptides, S-APP-alpha and S-APP-beta, respectively, and the 

CC retention of corresponding membrane-anchored C-terminal fragments, 

CC CTF-alpha and CTF-beta. Subsequent processing of CTF-alpha by 

CC gamma-secretase yields P3 peptides. This is the major secretory 

CC pathway and is nonamyloidogenic . Alternatively, 

CC presenilin/nicastrin-mediated gamma-secretase processing of CTF- 

CC beta releases the amyloid beta proteins, amyloid-beta 40 (Abeta40) 



CC and amyloid-beta 42 (Abeta42) , major components of amyloid 

CC plaques, and the corresponding cytotoxic C-terminal fragments 

CC (CTFs) . 

CC -!- PTM: Proteolytically cleaved by caspase-3 during neuronal 
CC apoptosis (By similarity) . 

CC -!- PTM: N- and O-glycosylated. O-linkage of chondroitin sulfate to 
CC the L-APP isoforms produces the APP proteoglycan core proteins, 

CC the appicans (By similarity) . 

CC -!- PTM: Phosphorylation in the C-terminal on tyrosine, threonine and 

CC serine residues is neuron-specific (By similarity) . 

CC Phosphorylation can affect APP processing, neuronal 

CC differentiation and interaction with other proteins. 

CC PTM: Extracellular binding and reduction of copper, results in a 

CC corresponding oxidation of Cys-144 and Cys-158, and the formation 

CC of a disulfide bond (By similarity) . 

CC -!- MISCELLANEOUS: Chelation of metal ions, notably copper, iron and 

CC zinc, can induce histidine-bridging between beta-amyloid molecules 

CC resulting in beta-amyloid-metal aggregates. 

CC -!- SIMILARITY: Belongs to the APP family. 

CC -!- SIMILARITY: Contains 1 BPTI/Kunitz inhibitor domain. 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X97631; CAA66230.1; -. 

DR EMBL; X99198; CAA67589.1; -. 

DR HSSP; P05067; 1BA4 . 

DR InterPro; IPR008155; A4_APP. 

DR InterPro; IPR008154; A4_extra. 

DR InterPro; IPR002223; Kunitz_BPTI. 

DR Pfam; PF00014; Kunitz_BPTI; 1. 

DR PRINTS; PR00203; AMYL0IDA4. 

DR PRINTS; PR00759; BASICPTASE. 

DR ProDom; PD000222; Kunitz_BPTI; 1. 

DR SMART; SM00006; A4_EXTRA; 1. 

DR SMART; SM00131; KU; 1. 

DR PROSITE; PS00319; A4_EXTRA; 1. 

DR PROSITE; PS00320; A4_INTRA; 1. 

DR PROSITE; PS00280; BPTI_KUNITZ_1 ; 1. 

DR PROSITE; PS50279; BPTI_KUNITZ__2 ; 1. 

KW Apoptosis; Endocytosis; Cell adhesion; Serine protease inhibitor; 

KW Coated pits; Neurone; Heparin-binding; Metal-binding; Copper; Iron; 

KW Zinc; Signal; Transmembrane; Glycoprotein; Phosphorylation; 

KW Proteoglycan; Alternative splicing; Amyloid. 

FT SIGNAL 1 17 BY SIMILARITY. 

FT CHAIN 18 770 AMYLOID BETA A4 PROTEIN. 

FT CHAIN 18 687 SOLUBLE APP-ALPHA (BY SIMILARITY). 

FT CHAIN 18 671 SOLUBLE APP-BETA (BY SIMILARITY) . 

FT CHAIN 672 770 CTF-ALPHA (BY SIMILARITY) . 

FT CHAIN 672 713 BETA-AMYLOID PROTEIN 42 (BY SIMILARITY) . 

FT CHAIN 672 711 BETA-AMYLOID PROTEIN 40 (BY SIMILARITY) . 

FT CHAIN 688 770 CTF-BETA (BY SIMILARITY) . 



FT 
FT 
FT 
FT 



CHAIN 
CHAIN 
CHAIN 
CHAIN 



688 
688 
712 
714 



713 
711 
770 
770 



P3(42) (BY SIMILARITY). 
P3(40) (BY SIMILARITY). 
GAMMA-CTF(59) (BY SIMILARITY) 
GAMMA-CTF (57 ) (BY SIMILARITY) 



Query Match 85.4%; 
Best Local Similarity 100.0% 
Matches 7; Conservative 



Score 35; DB 1; Length 770; 
Pred. No. 9.8; 
0; Mismatches 0; Indels 



0; Gaps 



0; 
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RESULT 8 
A4_HUMAN 

ID A4_HUMAN STANDARD; PRT; 770 AA. 

AC P05067; P09000; P78438; Q13764; Q13778; Q13793; Q16011; Q16014; 

AC Q16019; Q16020; Q9BT38; Q9UCA9 ; Q9UCB6; Q9UCC8; Q9UCD1; Q9UQ58; 

DT 13-AUG-1987 (Rel. 05, Created) 

DT 01-NOV-1991 (Rel. 20, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Amyloid beta A4 protein precursor (APP) (ABPP) (Alzheimer f s disease 

DE amyloid protein) (Cerebral vascular amyloid peptide) (CVAP) (Protease 

DE nexin-II) (PN-II) (APPI) (PreA4) [Contains: Soluble APP-alpha (S-APP- 

DE alpha); Soluble APP-beta ( S-APP-beta) ; C99; Beta-amyloid protein 42 

DE (Beta-APP42) ; Beta-amyloid protein 40 (Beta-APP40) ; C83; P3(42); 

DE P3(40); Gamma-CTF ( 59 ) (Gamma-secretase C-terminal fragment 59) 

DE (Amyloid intracellular domain 59) (AID(59)); Gamma-CTF ( 57 ) (Gamma- 

DE secretase C-terminal fragment 57) (Amyloid intracellular domain 57) 

DE (AID(57)); Gamma-CTF ( 50) (Gamma-secretase C-terminal fragment 50) 

DE (Amyloid intracellular domain 50) (AID(50)); C31] . 

GN APP OR A4 OR AD1 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N . A. (ISOFORM APP695) . 

RC TISSUE^Brain; 

RX MEDLINE=8 7144572; PubMed-2 881207; 

RA Kang J., Lemaire H.-G., Unterbeck A., Salbaum J.M., Masters C.L., 

RA Grzeschik K.-H,, Multhaup G., Beyreuther K. , Mueller-Hill B. ; 

RT "The precursor of Alzheimer's disease amyloid A4 protein resembles a 

RT cell-surface receptor."; 

RL Nature 325:733-736(1987). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM APP751) . 

RC TISSUE-Brain; 

RX MEDLINE- 8 8122639; PubMed=2 893289; 

RA Ponte P., Gonzalez-Dewhitt P., Schilling J., Miller J., Hsu D. , 

RA Greenberg B., Davis K. , Wallace W., Lieberburg I., Fuller F. , 

RA Cordell B. ; 

RT "A new A4 amyloid mRNA contains a domain homologous to serine 

RT proteinase inhibitors."; 

RL Nature 331:525-527(1988). 

RN [3] 



RP SEQUENCE FROM N.A. (ISOFORM APP695) . 

RX MEDLINE=89128427; PubMed-2 783775; 

RA Lemaire H.-G., Salbaum J.M., Multhaup G. , Kang J., Bayney R.M., 

RA Unterbeck A., Beyreuther K., Mueller-Hill B.; 

RT "The PreA4(695) precursor protein of Alzheimer ! s disease A4 amyloid 

RT is encoded by 16 exons. 

RL Nucleic Acids Res. 17:517-522(198 9). 

RN [4] 

RP SEQUENCE FROM N.A. (ISOFORM APP770) . 

RX MEDLINE=90236318; PubMed=2 110105; 

RA Yoshikai S.-I., Sasaki H., Doh-Ura K. , Furuya H., Sakaki Y.; 

RT "Genomic organization of the human amyloid beta-protein precursor 

RT gene . " ; 

RL Gene 87:257-263(1990). 

RN [5] 

RP ERRATUM, AND REVISIONS. 

RA Yoshikai S.-I., Sasaki H., Doh-ura K. , Furuya H., Sakaki Y.; 

RL Gene 102:291-292(1991). 

RN [6] 

RP SEQUENCE FROM N.A. (ISOFORM L-APP733) . 

RC TISSUE=Leukocyte; 

RX MEDLINE=92268136; PubMed=1587 857 ; 

RA Koenig G., Moenning U., Czech C, Prior R. , Banati R. , 

RA Schreiter-Gasser U., Bauer J., Masters C.L., Beyreuther K. ; 

RT "Identification and differential expression of a novel alternative 

RT splice isoform of the beta A4 amyloid precursor protein (APP) mRNA in 

RT leukocytes and brain microglial cells."; 

RL J. Biol. Chem. 267:10804-10809(1992). 

RN [7] 

RP SEQUENCE FROM N.A. (ISOFORM APP770). 

RX MEDLINE=97263807; PubMed=9108164 ; 

RA Hattori M. , Tsukahara F., Furuhata Y. , Tanahashi H., Hirose M., 

RA Saito M., Tsukuni S., Sakaki Y. ; 

RT "A novel method for making nested deletions and its application for 

RT sequencing of a 300 kb region of human APP locus."; 

RL Nucleic Acids Res. 25:1802-1808(1997). 

RN [8] 

RP SEQUENCE FROM N.A. (ISOFORM APP639). 

RC TISSUE=Brain; 

RX MEDLINE-22744650; PubMed=12859342; 

RA Tang K., Wang C, Shen C, Sheng S., Ravid R. , Jing N.; 

RT "Identification of a novel alternative splicing isoform of human 

RT amyloid precursor protein gene, APP639."; 

RL Eur. J. Neurosci. 18:102-108(2003). 

RN [9] 

RP SEQUENCE FROM N.A. (ISOFORM APP305). 

RC TISSUE=Pancreas ; 

RX MEDLINE=22388257; PubMed=12477932; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F . , 

RA Diatchenko L., Marusina K. , Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M., Soares M.B., Bonaldo M. F. , Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A. , Gunaratne P.H., 



RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M. , Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; " 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [10] 

RP SEQUENCE OF 1-10 FROM N.A. 

RC TISSUE^Liver; 

RX MEDLINE=89016647; PubMed=3140222 ; 

RA Schon E.A., Mita S., Sadlock J., Herbert J.; 

RT "A cDNA specifying the human amyloid beta precursor protein (ABPP) 

RT encodes a 95-kDa polypeptide."; 

RL Nucleic Acids Res. 16:9351-9351(1988). 

RN [11] 

RP ERRATUM, AND REVISIONS. 

RA Mita S., Sadlock J., Herbert J., Schon E.A.; 

RL Nucleic Acids Res. 16:11402-11402(1988). 

RN [12] 

RP SEQUENCE OF 1-75 FROM N.A. 

RX MEDLINE=89165870; PubMed=2538123 ; 

RA La Fauci G., Lahiri D.K., Salton S.R., Robakis N.K.; 

RT "Characterization of the 5 '-end region and the first two exons of the 

RT beta-protein precursor gene."; 

RL Biochem. Biophys . Res. Commun. 159:297-304(1989). 

RN [13] 

RP SEQUENCE OF 18-50. 

RC TISSUE=Fibroblast; 

RX MEDLINE=87250462; PubMed=3597385; 

RA van Nostrand W.E., Cunningham D.D.; 

RT "Purification of protease nexin II from human fibroblasts."; 

RL J. Biol. Chem. 262:8508-8514(1987). 

RN [14] 

RP PARTIAL SEQUENCE FROM N.A. (ISOFORM APP751) . 

RC TISSUE=Brain; 

RX MEDLINE=89346754; PubMed=2569763; 

RA de Sauvage F., Octave J.N.; 

RT "A novel mRNA of the A4 amyloid precursor gene coding for a possibly 

RT secreted protein."; 

RL Science 245:651-653(1989). 

RN [15] 

RP PARTIAL SEQUENCE FROM N.A. (ISOFORM APP695) . 

RC TISSUE=Brain; 

RX MEDLINE=87231971; PubMed=303557 4 ; 

RA Robakis N.K., Ramakrishna N., Wolfe G., Wisniewski H.M.; 

RT "Molecular cloning and characterization of a cDNA encoding the 

RT cerebrovascular and the neuritic plaque amyloid peptides."; 

RL Proc. Natl. Acad. Sci. U.S.A. 84:4190-4194(1987). 

RN [16] 

RP SEQUENCE OF 286-366 FROM N.A. 

RX MEDLINE-88122640; PubMed=28932 90 ; 

RA Tanzi R.E., McClatchey A. I . , Lamperti E.D., Villa-Komarof f L . , 



RA Gusella J.F., Neve R.L.; 

RT "Protease inhibitor domain encoded by an amyloid protein precursor 

RT mRNA associated with Alzheimer's disease."; 

RL Nature 331:528-530(1988). 

RN [17] 

RP SEQUENCE OF 287-367 FROM N.A. 

RX MEDLINE=88122641; PubMed=28 93291 ; 

RA Kitaguchi N., Takahashi Y., Tokushima Y., Shiojiri S. r Ito H.; 

RT "Novel precursor of Alzheimer's disease amyloid protein shows 

RT protease inhibitory activity."; 

RL Nature 331:530-532(1988). 

RN [18] 

RP SEQUENCE OF 507-770 FROM N.A. 

RC TISSUE=Brain cortex; 

RX MEDLINE- 88124954; P ubMed=2 8 93379; 

RA Zain S.B., Salim M. , Chou W.G., Sa j del-Sulkowska E.M. , Majocha R.E., 

RA Marotta C. A. ; 

RT "Molecular cloning of amyloid cDNA derived from mRNA of the Alzheimer 

RT disease brain: coding and noncoding regions of the fetal precursor 

RT mRNA are expressed in the cortex."; 

RL Proc. Natl. Acad. Sci. U.S.A. 85:929-933(1988). 

RN [19] 

RP SEQUENCE OF 523-555, AND COLLAGEN-BINDING DOMAIN. 

RX MEDLINE=96139497; PubMed=8576160 ; 

RA Beher D., Hesse L., Masters C.L., Multhaup G. ; 

RT "Regulation of amyloid protein precursor (APP) binding to collagen and 

RT mapping of the binding sites on APP and collagen type I."; 

RL J. Biol. Chem. 271:1613-1620(1996). 

RN [20] 

RP SEQUENCE OF 655-737 FROM N.A. , AND VARIANTS AD PHE-717; AD ILE-717 

RP AND AD GLY-717. 

RX MEDLINE= 93236601; PubMed= 8476439; 

RA Denman R.B., Rosenzcwaig R. , Miller D.L.; 

RT "A system for studying the effect (s) of familial Alzheimer disease 

RT mutations on the processing of the beta-amyloid peptide precursor."; 

RL Biochem. Biophys . Res. Commun. 192:96-103(1993). 

RN [21] 

RP SEQUENCE OF 656-737 FROM N.A. 

RX MEDLINE=89392030; PubMed=2675837 ; 

RA Johnstone E.M., Chaney M.O., Moore R.E., Ward K.E., Norris F.H., 

RA Little S.P. ; 

RT "Alzheimer 1 s disease amyloid peptide is encoded by two exons and shows 

RT similarity to soybean trypsin inhibitor."; 

RL Biochem. Biophys. Res. Commun. 163:1248-1255(1989). 

RN [22] 

Query Match 85.4%; Score 35; DB 1; Length 770; 

Best Local Similarity 100.0%; Pred. No. 9.8; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 LVFFAED 7 

I I I I I I 1 

Db 688 LVFFAED 694 



RESULT 9 
A4 MAC FA 



ID A4_MACFA STANDARD; PRT; 77 0 AA. 

AC P53601; Q95KN7; 

DT 01-OCT-1996 (Rel. 34, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Amyloid beta A4 protein precursor (APP) (ABPP) (Alzheimer's disease 

DE amyloid protein homolog) [Contains: Soluble APP-alpha (S-APP-alpha) ; 

DE Soluble APP-beta (S-APP-beta) ; C99; Beta-amyloid protein 42 (Beta- 

DE APP42); Beta-amyloid protein 40 (Beta-APP40) ; C83; P3(42); P3(40); 

DE Gamma-CTF(59) (Gamma-secretase C-terminal fragment 59); Gamma-CTF ( 57 ) 

DE (Gamma-secretase C-terminal fragment 57); Gamma-CTF ( 50 ) (Gamma- 

DE secretase C-terminal fragment 50) ; C31] . 

GN APP. 

OS Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Macaca. 

OX NCBI__TaxID=9541; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS APP695 AND APP770) . 

RC TISSUE=Cerebellum; 

RX MEDLINE=91273117; PubMed=1905108 ; 

RA Podlisny M.B., Tolan D.R., Selkoe D.J.; 

RT "Homology of the amyloid beta protein precursor in monkey and human 

RT supports a primate model for beta amyloidosis in Alzheimer f s 

RT disease."; 

RL Am. J. Pathol. 138:1423-1435(1991). 

CC -!- FUNCTION: Functions as a cell surface receptor and performs 

CC physiological functions on the surface of neurons relevant to 

CC neurite growth, neuronal adhesion and axonogenesis . Involved in 

CC cell mobility and transcription regulation through protein-protein 

CC interactions (By similarity) . Can promote transcription activation 

CC through binding to APBB1/Tip60 and inhibit Notch signaling through 

CC interaction with Numb (By similarity) . Couples to apoptosis- 

CC inducing pathways such as those mediated by G(O) and JIP (By 

CC similarity). Inhibits G(0) alpha ATPase activity (By similarity). 

CC Acts as a kinesin I membrane receptor, mediating the axonal 

CC transport of beta-secretase and presenilin 1 (By similarity) . May 

CC be involved in copper homeostasis/oxidative stress through copper 

CC ion reduction. In vitro, copper-metallated APP induces neuronal 

CC death directly or is potentiated through Cu ( II ) -mediated low- 

CC density lipoprotein oxidation (By similarity) . Can regulate 

CC neurite outgrowth through binding to components of the 

CC extracellular matrix such as heparin and collagen I and IV (By 

CC similarity) . The splice isoforms that contain the BPTI domain 

CC possess protease inhibitor activity (By similarity) . 

CC -!- FUNCTION: Beta-amyloid peptides are lipophilic metal chelators 

CC with metal-reducing activity. Bind transient metals such as 

CC copper, zinc and iron (By similarity) . 

CC -!- FUNCTION: The gamma-CTF peptides as well as the caspase-cleaved 

CC peptides, including C31, are potent enhancers of neuronal 

CC apoptosis (By similarity) . 

CC -!- SUBUNIT: Binds, via its C-terminal, to the PID domain of several 

CC cytoplasmic proteins, including APBB family members, the APBA 

CC family, MAPK8IP1, and SHCl, Numb and Dabl (By similarity) . Binding 

CC to Dabl inhibits its serine phosphorylation (By similarity) . Also 

CC interacts with GPCR-like protein BPP, FPRLl, APPBP1, IB1, KNS2 



CC (via its TPR domains) (By similarity), APPBP2 (via BaSS) and DDB1. 

CC In vitro, it binds MAPT via the MT-binding domains (By 

CC similarity) . Associates with microtubules in the presence of ATP 

CC and in a kinesin-dependent manner (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. Cell surface 

CC protein that rapidly becomes internalized via clathrin-coated 

CC pits. During maturation, the immature APP (N-glycosylated in the 

CC endoplasmic reticulum) moves to the Golgi complex where complete 

CC maturation occurs (O-glycosylated and sulfated) . After alpha- 

CC secretase cleavage, soluble APP is released into the extracellular 

CC space and the C-terminal is internalized to endosomes and 

CC lysosomes. Some APP accumulates in secretory transport vesicles 

CC leaving the late Golgi compartment and returns to the cell 

CC surface. Gamma-CTF ( 59 ) peptide is located to both the cytoplasm 

CC and nuclei of neurons (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Comment=Additional isoforms seem to exist; 

CC Name=APP770; 

CC IsoId=P53601-l; Sequence=Displayed; 

CC Name=APP695; 

CC IsoId=P53601-2; Sequence=VSP_000010, VSP_000011; 

CC -!- DOMAIN: The basolateral sorting signal (BaSS) is required for 

CC sorting of membrane proteins to the basolateral surface of 

CC epithelial cells (By similarity) . 

CC -!- DOMAIN: The NPXY sequence motif found in many tyrosine- 

CC phosphorylated proteins is required for the specific binding of 

CC the PID domain. However additional amino acids either N- or C- 

CC terminal to the NPXY motif are often required for complete 

CC interaction. The PID domain-containing proteins which bind APP 

CC require the YENPTY motif for full interaction. These interactions 

CC are independent of phosphorylation on the terminal tyrosine 

CC residue. The NPXY site is also involved in clathrin-mediated 

CC endocytosis (By similarity) . 

CC -!- PTM: Proteolytically processed under normal cellular conditions. 

CC Cleavage by alpha-secretase or alternatively by beta-secretase 

CC leads to generation and extracellular release of soluble APP 

CC peptides, S-APP-alpha and S-APP-beta, respectively, and the 

CC retention of corresponding membrane-anchored C-terminal fragments, 

CC C83 and C99. Subsequent processing of C83 by gamma-secretase 

CC yields P3 peptides. This is the major secretory pathway and is 

CC nonamyloidogenic. Alternatively, presenilin/nicastrin-mediated 

CC gamma-secretase processing of C99 releases the amyloid beta 

CC proteins, amyloid-beta 40 (Abeta40) and amyloid-beta 42 (Abeta42), 

CC major components of amyloid plaques, and the cytotoxic C-terminal 

CC fragments, gamma-CTF ( 50 ) , gamma-CTF ( 57 ) and gamma-CTF ( 59 ) (By 

CC similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during neuronal apoptosis 

CC (By similarity) . Cleavage at Asp-739 by either caspase-3, -8 or -9 

CC results in the production of the neurotoxic C31 peptide and the 

CC increased production of beta-amyloid peptides (By similarity) . 

CC -!- PTM: N- and O-glycosylated (By similarity). 

CC -!- PTM: Phosphorylation in the C-terminal on tyrosine, threonine and 

CC serine residues is neuron-specific. Phosphorylation can affect APP 

CC processing, neuronal differentiation and interaction with other 

CC proteins (By similarity) . 

CC -!- MISCELLANEOUS: Chelation of metal ions, notably copper, iron and 



CC zinc, can induce histidine-bridging between beta-amyloid molecules 

CC resulting in beta-amyloid-metal aggregates (By similarity) * 

CC Extracellular zinc-binding increases binding of heparin to APP and 

CC inhibits collagen-binding (By similarity) . 

CC -!- SIMILARITY: Belongs to the APP family. 

CC -!- SIMILARITY: Contains 1 BPTl/Kunitz inhibitor domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; M58727; AAA36829.1; -. 

DR EMBL; M58726; AAA36828.1; -. 

DR HSSP; P05067; 1AAP. 

DR InterPro; IPR008155; A4_APP. 

DR InterPro; IPR008154; A4_extra. 

DR InterPro; IPR001255; Beta-APP. 

DR InterPro; IPR002223; Kunitz_BPTI. 

DR Pfam; PF02177; A4_EXTRA; 1. 

DR Pfam; PF03494; Beta-APP; 1. 

DR Pfam; PF00014; Kunitz_BPTI; 1. 

DR PRINTS; PR00203; AMYL0IDA4. 

DR PRINTS; PR00759; BASICPTASE. 

DR ProDom; PD000222; Kunitz_BPTI; 1. 

DR SMART; SM00006; A4_EXTRA; 1. 

DR SMART; SM00131; KU; 1. 

DR PROSITE; PS00319; A4_EXTRA; 1. 

DR PROSITE; PS00320; A4_INTRA; 1. 

DR PROSITE; PS00280; BPTI_KUNITZ_1 ; 1. 

DR PROSITE; PS50279; BPTI__KUNITZ_2 ; 1. 

KW Apoptosis; Endocytosis; Cell adhesion; Serine protease inhibitor; 

KW Coated pits; Neurone; Heparin-binding; Metal-binding; Copper; Iron; 

KW Zinc; Signal; Transmembrane; Glycoprotein; Phosphorylation; 

KW Proteoglycan; Alternative splicing; Amyloid. 
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RESULT 10 
A4_MOUSE 

ID A4J40USE STANDARD; PRT; 770 AA. 

AC P12023; P97487; P97942; Q99K32; 

DT 01-OCT-1989 (Rel. 12, Created) 

DT 10-OCT-2003 (Rel. 42, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Amyloid beta A4 protein precursor (APP) (ABPP) (Alzheimer's disease 

DE amyloid protein homolog) (Amyloidogenic glycoprotein) (AG) [Contains: 

DE Soluble APP-alpha (S-APP-alpha) ; Soluble APP-beta (S-APP-beta) ; C99 

DE (APP-C99) ; Beta-amyloid protein 42 (Beta-APP42 ) ; Beta-amyloid protein 

DE 40 (Beta-APP40) ; C83; P3(42); P3(40); Gamma-CTF ( 59 ) (Gamma-secretase 

DE C-terminal fragment 59) (Amyloid intracellular domain 59) (AID(59)) 

DE (APP-C59) ; Gamma-CTF ( 57 ) (Gamma-secretase C-terminal fragment 57) 

DE (Amyloid intracellular domain 57) (AID(57)) (APP-C57); Gamma-CTF ( 50 ) 

DE (Gamma-secretase C-terminal fragment 50) (Amyloid intracellular domain 

DE 50) (AID (50) ) ; C31] . 

GN APP. 



OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM APP695) . 

RC TISSUE=Brain; 

RX MEDLINE=88106489; PubMed=3322280 ; 

RA Yamada T., Sasaki H., Furuya H. , Miyata T., Goto I., Sakaki Y. ; 

RT "Complementary DNA for the mouse homolog of the human amyloid beta 

RT protein precursor."; 

RL Biochem. Biophys . Res. Commun. 149:665-671(1987). 

RN [2] 

RP REVISIONS. 

RA Yamada T . ; 

RL Submitted (MAR-1988) to the EMBL/GenBank/DDB J databases. 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORM APP695). 

RC STRAIN=BALB/ c; TISSUE=Brain; 

RX MEDLINE=92096458; PubMed=1756177 ; 

RA de Strooper B., van Leuven F. , van den Berghe H. ; 

RT "The amyloid beta protein precursor or proteinase nexin II from mouse 

RT is closer related to its human homolog than previously reported."; 

RL Biochim. Biophys. Acta 1129:141-143(1991). 

RN [4] 

RP SEQUENCE FROM N.A. (ISOFORM APP695) . 

RC STRAIN=SAMP8; TISSUE=Hippocampus ; 

RX MEDLINE=21130647; PubMed=1123592 1 ; 

RA Kumar V.B., Vyas K., Franko M. , Choudhary V. , Buddhiraju C, 

RA Alvarez J., Morley J.E.; 

RT "Molecular cloning, expression, and regulation of hippocampal amyloid 

RT precursor protein of senescence accelerated mouse (SAMP8)."; 

RL Biochem. Cell Biol. 79:57-67(2001). 

RN [5] 

RP SEQUENCE OF 1-19 FROM N.A. 

RX MEDLINE-92209998; PubMed=15557 68 ; 

RA Izumi R., Yamada T., Yoshikai S.I., Sasaki H., Hattori M. , 

RA Sakai Y. ; 

RT "Positive and negative regulatory elements for the expression of the 

RT Alzheimer's disease amyloid precursor-encoding gene in mouse."; 

RL Gene 112:189-195(1992). 

RN [6] . 

RP PARTIAL SEQUENCE FROM N.A. (ISOFORM APP770) . 

RC TISSUE^Breast tumor; 

RX MEDLINE=22388257; PubMed=12477 932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K. , Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P. J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A. r 



RA Whiting M., Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W. , Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15, 000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [7] 

RP SEQUENCE OF 281-380 FROM N.A., AND ALTERNATIVE SPLICING. 

RC TISSUE=Brain, and Kidney; 

RX MEDLINE=8 9149813; PubMed=2 493250; 

RA Yamada T., Sasaki H., Dohura K., Goto I., Sakaki Y.; 

RT "Structure and expression of the alternatively-spliced forms of mRNA 

RT for the mouse homolog of Alzheimer's disease amyloid beta protein 

RT precursor . " ; 

RL Biochem. Biophys . Res. Commun. 158:906-912(1989). 

RN [8] 

RP SEQUENCE OF 289-364 FROM N.A. 

RC STRAIN=CD-1; TISSUE=Placenta ; 

RX MEDLINE=8 9345111; PubMed^2 569710; 

RA Fukuchi K., Martin G.M., Deeb S.S.; 

RT "Sequence of the protease inhibitor domain of the A4 amyloid protein 

RT precursor of Mus domesticus . 11 ; 

RL Nucleic Acids Res. 17:5396-5396(1989). 

RN [9] 

RP SEQUENCE OF 656-737 FROM N.A. 

RC STRAIN=129/Sv; 

RA Wragg M.A. , Busfield F., Duff K. , Korenblat K. , Capecchi M. , 

RA Loring J.F., Goate A.M.; 

RT "Introduction of six mutations into the mouse genome using ! Hit and 

RT Run 1 gene-targeting: introduction of familial Alzheimer f s disease 

RT mutations into the mouse amyloid precursor protein gene and 

RT humanization of the A-beta fragment."; 

RL Submitted (DEC-1996) to the EMBL/ GenBank/DDB J databases. 

RN [10] 

RP TISSUE SPECIFICITY OF ALTERNATIVE SPLICED FORMS. 

RX MEDLINE=93287808; PubMed-8510506; 

RA Sola C, Mengod G., Ghetti B., Palacios J.M. , Triarhou L.C.; 

RT "Regional distribution of the alternatively spliced isoforms of beta 

RT APP RNA transcript in the brain of normal, heterozygous and 

RT homozygous weaver mutant mice as revealed by in situ hybridization 

RT histochemistry."; 

RL Brain Res. Mol . Brain Res. 17:340-346(1993). 

RN [11] 

RP INTERACTION WITH KNS2. 

RX MEDLINE=2 101 0507; PubMed-1114 4 355; 

RA Kamal A., Stokin G.B., Yang Z., Xia C.-H., Goldstein L.S.; 

RT "Axonal transport of amyloid precursor protein is mediated by direct 

RT binding to the kinesin light chain subunit of kinesin-I."; 

RL Neuron 2 8:449-459(2000). 

RN [12] 

RP C-TERMINAL PROTEIN-PROTEIN INTERACTIONS, AND MUTAGENESIS OF TYR-728; 

RP THR-743; TYR-757; ASN-759 AND TYR-762. 

RX MEDLINE=21408156; PubMed-11517249; 

RA Matsuda S., Yasukawa T., Homma Y. , Ito Y. , Niikura T., Hiraki T., 

RA Hirai S., Ohno S., Kita Y., Kawasumi M. , Kouyama K., Yamamoto T., 



RA Kyriakis J.M., Nishimoto I.; 

RT "C-jun N-terminal kinase (JNK) -interacting protein-lb/islet-brain-1 

RT scaffolds Alzheimer's amyloid precursor protein with JNK."; 

RL J. Neurosci. 21:6597-6607(2001). 

RN [13] 

RP INTERACTION WITH MAPK8IP1, AND PHOSPHORYLATION. 

RX MEDLINE=22 028 091; PubMed=1191218 9; 

RA Taru H., Iijima K.-I., Hase M. , Kirino Y., Yagi Y. , Suzuki T. ; 

RT "Interaction of Alzheimer's beta-amyloid precursor family proteins 

RT with scaffold proteins of the JNK signaling cascade."; 

RL J. Biol. Chem. 277:2007 0-2007 8(2 002). 

RN [14] 

RP INTERACTION OF CTF PEPTIDES WITH NUMB. 

RX MEDLINE=22 008109; PubMed=12 0114 66; 

RA Roncarati R., Sestan N., Scheinfeld M.H., Berechid B.E., Lopez P. A., 

RA Meucci O., McGlade J.C., Rakic P . , D'Adamio L. ; 

RT "The gamma-secretase-generated intracellular domain of beta-amyloid 

RT precursor protein binds Numb and inhibits Notch signaling."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:7102-7107(2 002). 

RN [15] 

RP GAMMA- SEC RET AS E PROCESSING, AND INTERACTION WITH APBB1 . 

RX MEDLINE=21437805; PubMed=11553691 ; 

RA Cupers P., Orlans I., Craessaerts K. , Annaert W. , De Strooper B.; 

RT "The amyloid precursor protein (APP) -cytoplasmic fragment generated by 

RT gamma-secretase is rapidly degraded but distributes partially in a 

RT nuclear fraction of neurones in culture . " ; 

RL J. Neurochem. 78:1168-1178(2001). 

CC -!- FUNCTION: Functions as a cell surface receptor and performs 

CC physiological functions on the surface of neurons relevant to 

CC neurite growth, neuronal adhesion and axonogenesis . Involved in 

CC cell mobility and transcription regulation through protein-protein 

CC interactions. Can promote transcription activation through binding 

CC to APBB1/Tip60 and inhibit Notch signaling through interaction 

CC with Numb. Couples to apoptosis-inducing pathways such as those 

CC mediated by G(O) and JIP. Inhibits G(0) alpha ATPase activity (By 

CC similarity) . Acts as a kinesin I membrane receptor, mediating the 

CC axonal transport of beta-secretase and presenilin 1. May be 

CC involved in copper homeostasis/oxidative stress through copper ion 

CC reduction. Can regulate neurite outgrowth through binding to 

CC components of the extracellular matrix such as heparin and 

CC collagen I and IV (By similarity) . The splice isoforms that 

CC contain the BPTI domain possess protease inhibitor activity (By 

CC similarity) . 

CC -!- FUNCTION: Beta-amyloid peptides are lipophilic metal chelators 

CC with metal-reducing activity. Bind transient metals such as 

CC copper, zinc and iron. Rat and mouse beta-amyloid peptides bind 

CC only weakly transient metals and have little reducing activity due 

CC to substitutions of transient metal chelating residues. Beta-APP42 

CC may activate mononuclear phagocytes in the brain and elicit 

CC inflammatory responses. Promotes both tau aggregation and TPK II- 

CC mediated phosphorylation (By similarity) . 

CC -!- FUNCTION: The gamma-CTF peptides as well as the caspase-cleaved 
CC peptides, including C31, are potent enhancers of neuronal 

CC apoptosis . 

CC -!- SUBUNIT: Binds, via its C-terminal, to the PID domain of several 
CC cytoplasmic proteins, including APBB family members, the APBA 

CC family, MAPK8IP1, SHC1, Numb and Dabl. Binding to Dabl inhibits 



CC its serine phosphorylation. Also interacts with GPCR-like protein 

CC BPP, FPRL1, APPBP1, IB1, KNS2 (via its TPR domains), APPBP2 (via 

CC BaSS) and DDBl (By similarity) . In vitro, it binds MAPT via the 

CC MT-binding domains (By similarity) . Associates with microtubules 

CC in the presence of ATP and in a kinesin-dependent manner (By 

CC similarity) . Interacts, through a C-terminal domain, with GNA01 

CC (By similarity) . Amyloid beta-42 binds CHRNA7 in hippocampal 

CC neurons (By similarity) . Beta-amyloid associates with HADH2 (By 

CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. Cell surface 

CC protein that rapidly becomes internalized via clathrin-coated 

CC pits. During maturation, the immature APP (N-glycosylated in the 

CC endoplasmic reticulum) moves to the Golgi complex where complete 

Query Match 85.4%; Score 35; DB 1; Length 770; 

Best Local Similarity 100.0%; Pred. No. 9.8; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
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1 I I I I I I 

Db 688 LVFFAED 694 



RESULT 11 
A4_PIG 

ID A4_PIG STANDARD; PRT; 770 AA. 

AC P79307; Q29023; Q9TUI0; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 10-OCT-2003 (Rel. 42, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update )■ 

DE Amyloid beta A4 protein precursor (APP) (ABPP) (Alzheimer ! s disease 

DE amyloid protein homolog) [Contains: Soluble APP-alpha (S-APP-alpha) ; 

DE Soluble APP-beta (S-APP-beta) ; C99; Beta-amyloid protein 42 (Beta- 

DE APP42); Beta-amyloid protein 40 ( Beta-APP40 ) ; C83; P3(42); P3(40); 

DE Gamma-CTF(59) (Gamma-secretase C-terminal fragment 59); Gamma-CTF ( 57 ) 

DE (Gamma-secretase C-terminal fragment 57); Gamma-CTF ( 50) (Gamma- 

DE secretase C-terminal fragment 50) ; C31] . 

OS Sus scrofa (Pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Cetartiodactyla; Suina; Suidae; Sus. 

OX NCBI_TaxID=9 8 2 3 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Kimura A., Takahashi T.; 

RT "Amyloid precursor protein 770."; 

RL Submitted (SEP-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE OF 1-136 FROM N.A. 

RC TISSUE^Small intestine; 

RA Winteroe A.K., Fredholm M. ; 

RT "Evaluation and characterization of a porcine small intestine cDNA 

RT library."; 

RL Submitted (JAN-1997) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE OF 667-723 FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=92017079; PubMed=1656157 ; 



RA Johnstone E.M., Chaney M.O., Norris F.H., Pascual R. , Little S.P.; 

RT "Conservation of the sequence of the Alzheimer's disease amyloid 

RT peptide in dog, polar bear and five other mammals by cross-species 

RT polymerase chain reaction analysis."; 

RL Brain Res. Mol . Brain Res. 10:299-305(1991). 

CC -!- FUNCTION: Functions as a cell surface receptor and performs 
CC physiological functions on the surface of neurons relevant to 

CC neurite growth, neuronal adhesion and axonogenesis . Involved in 

CC cell mobility and transcription regulation through protein-protein 

CC interactions (By similarity) . Can promote transcription activation 

CC through binding to APBB1/Tip60 and inhibit Notch signaling through 

CC interaction with Numb (By similarity) . Couples to apoptosis- 

CC inducing pathways such as those mediated by G(O) and JIP (By 

CC similarity). Inhibits G(0) alpha ATPase activity (By similarity). 

CC Acts as a kinesin I membrane receptor, mediating the axonal 

CC transport of beta-secretase and presenilin 1 (By similarity) . May 

CC be involved in copper homeostasis/oxidative stress through copper 

CC ion reduction (By similarity) . In vitro, copper-metallated APP 

CC induces neuronal death directly or is potentiated through Cu(II)- 

CC mediated low-density lipoprotein oxidation (By similarity) . Can 

CC regulate neurite outgrowth through binding to components of the 

CC extracellular matrix such as heparin and collagen I and IV (By 

CC similarity) . 

CC -!- FUNCTION: Beta-amyloid peptides are lipophilic metal chelators 
CC with metal-reducing activity. Bind transient metals such as 

CC copper, zinc and iron (By similarity) . 

CC -!- FUNCTION: The gamma-CTF peptides as well as the caspase-cleaved 
CC peptides, including C31, are potent enhancers of neuronal 

CC apoptosis (By similarity) . 

CC -!- SUBUNIT: Binds, via its C-terminal, to the PID domain of several 

CC cytoplasmic proteins, including APBB family members, the APBA 

CC family, MAPK8IP1, and SHC1, Numb and Dabl (By similarity) . Binding 

CC to Dabl inhibits its serine phosphorylation (By similarity) . Also 

CC interacts with GPCR-like protein BPP, FPRL1, APPBP1, IB1, KNS2 

CC (via its TPR domains) (By similarity), APPBP2 (via BaSS) and DDBl. 

CC In vitro, it binds MAPT via the MT-binding domains (By 

CC similarity) . Associates with microtubules in the presence of ATP 

CC and in a kinesin-dependent manner (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. Cell surface 

CC protein that rapidly becomes internalized via clathrin-coated 

CC pits. During maturation, the immature APP (N-glycosylated in the 

CC endoplasmic reticulum) moves to the Golgi complex where complete 

CC maturation occurs (O-glycosylated and sulfated) . After alpha- 

CC secretase cleavage, soluble APP is released into the extracellular 

CC space and the C-terminal is internalized to endosomes and 

CC lysosomes. Some APP accumulates in secretory transport vesicles, 

CC leaving the late Golgi compartment and returns to the cell 

CC surface. Gamma-CTF ( 59) peptide is located to both the cytoplasm 

CC and nuclei of neurons (By similarity) . 

CC -!- DOMAIN: The basolateral sorting signal (BaSS) is required for 
CC sorting of membrane proteins to the basolateral surface of 

CC epithelial cells (By similarity) . 

CC -!- DOMAIN: The NPXY sequence motif found in many tyrosine- 

CC phosphorylated proteins is required for the specific binding of 

CC the PID domain. However additional amino acids either N- or C- 

CC terminal to the NPXY motif are often required for complete 

CC interaction. The PID domain-containing proteins which bind APP 



CC require the YENPTY motif for full interaction. These interactions 

CC are independent of phosphorylation on the terminal tyrosine 

CC residue. The NPXY site is also involved in clathrin-mediated 

CC endocytosis (By similarity) . 

CC -!- PTM: Proteolytically processed under normal cellular conditions. 

CC Cleavage by alpha-secretase or alternatively by beta-secretase 

CC leads to generation and extracellular release of soluble APP 

CC peptides, S-APP-alpha and S-APP-beta, respectively, and the 

CC retention of corresponding membrane-anchored C-terminal fragments, 

CC C83 and C99. Subsequent processing of C83 by gamma-secretase 

CC yields P3 peptides. This is the major secretory pathway and is 

CC nonamyloidogenic . Alternatively, presenilin/nicastrin-mediated 

CC gamma-secretase processing of C99 releases the amyloid beta 

CC proteins, amyloid-beta 40 (Abeta40) and amyloid-beta 42 (Abeta42), 

CC major components of amyloid plaques, and the cytotoxic C-terminal 

CC fragments, gamma- CTF ( 50 ) , gamma-CTF ( 57 ) and gamma-CTF ( 59 ) (By 

CC similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during neuronal apoptosis 
CC (By similarity) . Cleavage at Asp-739 by either caspase-3, -8 or -9 

CC results in the production of the neurotoxic C31 peptide and the 

CC increased production of beta-amyloid peptides (By similarity) . 

CC -!- PTM: N- and O-glycosylated (By similarity). 

CC -!- PTM: Phosphorylation in the C-terminal on tyrosine, threonine and 
CC serine residues is neuron-specific. Phosphorylation can affect APP 

CC processing, neuronal differentiation and interaction with other 

CC proteins (By similarity) . 

CC -!- PTM: Extracellular binding and reduction of copper, results in a 
CC corresponding oxidation of Cys-144 and Cys-158, and the formation 

CC of a disulfide bond (By similarity) . 

CC -!- MISCELLANEOUS: Chelation of metal ions, notably copper, iron and 

CC zinc, can induce histidine-bridging between beta-amyloid molecules 

CC resulting in beta-amyloid-metal aggregates (By similarity) . 

CC Extracellular zinc-binding increases binding of heparin to APP and 

CC inhibits collagen-binding (By similarity) . 

CC -!- SIMILARITY: Belongs to the APP family. 

CC -!- SIMILARITY: Contains 1 BPTI/Kunitz inhibitor domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AB032550; BAA84580.1; 

DR EMBL; Z84022; CAB06313.1; -. 

DR EMBL; X56127; CAA39592.1; -. 

DR HSSP; P05067; 1AAP . 

DR InterPro; IPR008155; A4_APP. 

DR InterPro; IPR008154; A4__extra. 

DR InterPro; IPR002223; Kunitz_BPTI. 

DR Pfam; PF02177; A4_EXTRA; 1. 

DR PRINTS; PR00203; AMYL0IDA4 . 

DR PRINTS; PR00759; BASICPTASE. 

DR ProDom; PD000222; Kunitz_BPTI; 1. 

DR SMART; SM00006; A4 EXTRA; 1. 
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RESULT 12 
A4_RAT 

ID A4_RAT STANDARD; PRT; 770 AA. 

AC P08592; 

DT 01-AUG-1988 (Rel. 08, Created) 

DT 01-DEC-1992 (Rel. 24, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Amyloid beta A4 protein precursor (APP) (Alzheimer's disease amyloid 

DE protein homolog) (Amyloidogenic glycoprotein) (AG) [Contains: Soluble 

DE APP-alpha (S-APP-alpha) ; Soluble APP-beta (S-APP-beta) ; C99; Beta- 

DE amyloid protein 42 (Beta-APP42 ) ; Beta-amyloid protein 40 (Beta-APP40) ; 

DE C83; P3(42); P3(40); Gamma-CTF ( 59 ) (Gamma-secretase C-terminal 

DE fragment 59); Gamma-CTF ( 57 ) (Gamma-secretase C-terminal fragment 57); 

DE Gamma-CTF ( 50 ) (Gamma-secretase C-terminal fragment 50); C31] . 

GN APP. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM APP695) . 

RC TISSUE=Brain; 

RX MEDLINE=88312583; PubMed=2 900758 ; 

RA Shivers B.D., Hilbich C, Multhaup G. , Salbaum J.M., Beyreuther K., 

RA Seeburg P.H. ; 

RT "Alzheimer's disease amyloidogenic glycoprotein: expression pattern 

RT in rat brain suggests a role in cell contact."; 

RL EMBO J. 7:1365-1370(1988). 

RN [2] 

RP SEQUENCE OF 289-364 FROM N.A. 

RC TISSUE=Liver; 

RX MEDLINE==89183625; PubMed=2 64 8331 ; 

RA Kang J., Mueller-Hill B.; 

RT "The sequence of the two extra exons in rat preA4."; 

RL Nucleic Acids Res. 17:2130-2130(1989). 

RN [3] 

RP SEQUENCE OF 720-730, AND MASS SPECTROMETRY. 

RX MEDLI NE=2 1443797; PubMed= 11483588; 

RA Gu Y., Misonou H., Sato T. , Dohmae N., Takio K., Ihara Y.; 

RT "Distinct intramembrane cleavage of the beta-amyloid precursor protein 

RT family resembling gamma-secretase-like cleavage of Notch."; 

RL J. Biol. Chem. 27 6:35235-35238(2 001). 

RN [4] 

RP ALTERNATIVE SPLICING. 

RX MEDLINE= 96187032; PubMed- 8 624099; 

RA Sandbrink R. , Masters C.L., Beyreuther K. ; 

RT "APP gene family. Alternative splicing generates functionally related 

RT isof orms . " ; 

RL Ann. N.Y. Acad. Sci . 777:2 81-2 87(1996). 

RN [5] 



RP TISSUE SPECIFICITY OF APPICAN. 

RX MEDLINE-95263526; PubMed=774 4 833; 

RA Shioi J., Pangalos M.N., Ripellino J. A., Vassilacopoulou D., 

RA Mytilineou C, Margolis R.U., Robakis N.K.; 

RT "The Alzheimer amyloid precursor proteoglycan (appican) is present in 

RT brain and is produced by astrocytes but not by neurons in primary 

RT neural cultures."; 

RL J. Biol. Chem. 270:11839-118 44(1995). 

RN [6] 

RP TISSUE SPECIFICITY OF ISOFORMS. 

RX MEDLINE=97 150061; PubMed=8 996834 ; 

RA Sandbrink R. , Monning U., Masters C.L., Beyreuther K.; 

RT "Expression of the APP gene family in brain cells, brain development 

RT and aging."; 

RL Gerontology 43:119-131(1997). 

RN [7] 

RP INTERACTION WITH DDB1, AND MUTAGENESIS OF TYR-757; ASN-759 AND 

RP TYR-762. 

RX MEDLINE-99127916; PubMed=993072 6; 

RA Watanabe T., Sukegawa J. f Tomita S., Iijima K.-I., Oguchi S., 

RA Suzuki T., Nairn A. C, Greengard P.; 

RT "A 127-kDa protein (UV-DDB) binds to the cytoplasmic domain of the 

RT Alzheimer's amyloid precursor protein."; 

RL J. Neurochem. 72:54 9-556(199 9). 

RN [8] 

RP INTERACTION WITH GNAOl, AND MUTAGENESIS OF 732-HIS-HIS-733 . 

RX MEDLINE-99162676; PubMed=10024358 ; 

RA Brouillet E., Trembleau A., Galanaud D., Volovitch M. , Bouillot C, 

RA Valenza C, Prochiantz A., Allinquant B.; 

RT "The amyloid precursor protein interacts with Go heterotrimeric 

RT protein within a cell compartment specialized in signal 

RT transduction . " ; 

RL J. Neurosci. 19:1717-1727(1999). 

RN [9] 

RP CHARACTERISTICS OF APPICAN, AND MUTAGENESIS OF SER-656. 

RX MEDLINE-95256193; PubMed-7737970; 

RA Pangalos M.N., Ef thimiopoulos S., Shioi J., Robakis N.K.; 

RT "The chondroitin sulfate attachment site of appican is formed by 

RT splicing out exon 15 of the amyloid precursor gene."; 

RL J. Biol. Chem. 270:10388-10391(1995). 

RN [10] 

RP BETA- AMYLOID METAL-BINDING. 

RX MEDLINE=9 93 16162 ; PubMed=1038 6999; 

RA Huang X., Atwood C.S., Hartshorn M.A., Multhaup G. , Goldstein L.E., 

RA Scarpa R.C., Cuajungco M.P., Gray D.N., Lim J., Moir R.D., Tanzi R.E., 

RA Bush A. I . ; 

RT "The A beta peptide of Alzheimer's disease directly produces hydrogen 

RT peroxide through metal ion reduction."; 

RL Biochemistry 38 : 7 609-7616 ( 1999) . 

RN [11] 

RP BETA-AMYLOID ZINC BINDING. 

RX MEDLINE=99343552; PubMed=104 13512 ; 

RA Liu S.T., Howlett G., Barrow C.J.; 

RT "Histidine-13 is a crucial residue in the zinc ion-induced aggregation 

RT of the A beta peptide of Alzheimer's disease."; 

RL Biochemistry 38:9373-9378(1999). 

RN [12] 



RP IMPORTANCE OF GLY-704 IN FREE RADICAL PROPAGATION, AND MUTAGENESIS OF 

RP GLY-7 04. 

RX MEDLINE=21956095; PubMed=11959460 ; 

RA Kanski J. , Varadarajan S., Aksenova M. , Butterfield D.A. ; 

RT "Role of glycine-33 and methionine-35 in Alzheimer's amyloid beta- 

RT peptide 1-42-associated oxidative stress and neurotoxicity."; 

RL Biochim. Biophys . Acta 1586:190-198(2001). 

RN [13] 

RP PHOSPHORYLATION. 

RX MEDLINE-97239592; PubMed=9085254 ; 

RA Oishi M. , Nairn A.C., Czernik A. J., Lim G.S., Isohara T., Gandy S.E., 

RA Greengard P., Suzuki T.; 

RT "The cytoplasmic domain of Alzheimer 1 s amyloid precursor protein is 

RT phosphorylated at Thr654, Ser655, and Thr668 in adult rat brain and 

RT cultured cells."; 

RL Mol. Med. 3:111-123(1997). 

RN [14] 

RP PHOSPHORYLATION ON SER-730. 

RX MEDLINE=99262094; PubMed=1032 9382 ; 

RA Isohara T., Horiuchi A., Watanabe T., Ando K., Czernik A. J., Uno I., 

RA Greengard P . , Nairn A.C., Suzuki T.; 

RT "Phosphorylation of the cytoplasmic domain of Alzheimer's beta-amyloid 

RT precursor protein at Ser655 by a novel protein kinase."; 

RL Biochem. Biophys. Res. Commun. 258:300-305(1999). 

RN [15] 

RP PHOSPHORYLATION, INDUCTION, SUBCELLULAR LOCATION, AND MUTAGENESIS OF 

RP THR-743. 

RX MEDLINE=99274744; PubMed-10341243 ; 

RA Ando K., Oishi M. , Takeda S., Iijima K.-I., Isohara T., Nairn A.C., 

RA Kirino Y., Greengard P., Suzuki T.; 

RT "Role of phosphorylation of Alzheimer's amyloid precursor protein 

RT during neuronal differentiation."; 

RL J. Neurosci. 19:4421-4427(1999). 

RN [16] 

RP PHOSPHORYLATION ON THR-743. 

RX MEDLINE=20396183; PubMed=10936190 ; 

RA Iijima K.-I., Ando K., Takeda S., Satoh Y., Seki T., Itohara S., 

RA Greengard P., Kirino Y., Nairn A.C., Suzuki T.; 

RT "Neuron-specific phosphorylation of Alzheimer's beta-amyloid precursor 

RT protein by cyclin-dependent kinase 5."; 

RL J. Neurochem. 75:1085-1091(2000). 

RN [17] 

RP CARBOHYDRATE STRUCTURE OF APPICAN. 

RX MEDLINE-21463085; PubMed=11479316 ; 

RA Tsuchida K., Shioi J., Yamada S., Boghosian G., Wu A., Cai H., 

RA Sugahara K., Robakis N.K.; 

RT "Appican, the proteoglycan form of the amyloid precursor protein, 

RT contains chondroitin sulfate E in the repeating disaccharide region 

RT and 4-O-sulfated galactose in the linkage region."; 

RL J. Biol. Chem. 276:37155-37160(2001). 

CC -!- FUNCTION: Functions as a cell surface receptor and performs 
CC physiological functions on the surface of neurons relevant to 

CC neurite growth, neuronal adhesion and axonogenesis . Involved in 

CC cell mobility and transcription regulation through protein-protein 

CC interactions (By similarity) . Can promote transcription activation 

CC through binding to APBBl/Tip60 and inhibit Notch signaling through 

CC interaction with Numb (By similarity) . Couples to apoptosis- 



CC inducing pathways such as those mediated by G(0) and JIP. Inhibits 

CC G(0) alpha ATPase activity. Acts as a kinesin I membrane receptor, 

CC mediating the axonal transport of beta-secretase and presenilin 1 

CC (By similarity). May be involved in copper homeostasis/oxidative 

CC stress through copper ion reduction. Can regulate neurite 

CC outgrowth through binding to components of the extracellular 

CC matrix such as heparin and collagen I and IV (By similarity) . The 

CC splice isoforms that contain the BPTI domain possess protease 

CC inhibitor activity (By similarity) . 

CC -!- FUNCTION: Beta-amyloid peptides are lipophilic metal chelators 

CC with metal-reducing activity. Bind transient metals such as 

CC copper, zinc and iron. Rat and mouse beta-amyloid peptides bind 

CC only weakly transient metals and have little reducing activity due 

CC to substitutions of transient metal chelating residues. Beta-APP42 

CC may activate mononuclear phagocytes in the brain and elicit 

CC inflammatory responses. Promotes both tau aggregation and TPK II- 

CC mediated phosphorylation (By similarity) . 

CC -!- FUNCTION: Appicans elicit adhesion of neural cells to the 

CC extracellular matrix and may regulate neurite outgrowth in the 

CC brain. 

CC -!- FUNCTION: The gamma-CTF peptides as well as the caspase-cleaved 

CC peptides, including C31, are potent enhancers of neuronal 

CC apoptosis (By similarity) . 

CC -!- SUBUNIT: Binds, via its C- terminal, to the PID domain of several 

CC cytoplasmic proteins, including APBB family members, the APBA 

CC family, MAPK8IP1, SHC1 and Numb and Dabl (By similarity) . Binding 

CC to Dabl inhibits its serine phosphorylation (By similarity) . Also 

CC interacts with GPCR-like protein BPP, FPRL1, APPBP1, IB1, KNS2 

CC (via its TPR domains), APPBP2 (via BaSS) (By similarity) and DDB1. 

CC In vitro, it binds MAPT via the MT-binding domains (By 

CC similarity) . Associates with microtubules in the presence of ATP 

CC and in a kinesin-dependent manner (By similarity) . Interacts, 

CC through a C-terminal domain, with GNAOl. Amyloid beta-42 binds 

CC CHRNA7 in hippocampal neurons (By similarity) . Beta-amyloid 

CC associates with HADH2 (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. Cell surface 

CC protein that rapidly becomes internalized via clathrin-coated 

CC pits. During maturation, the immature APP (N-glycosylated in the 



Query Match 85.4%; Score 35; DB 1; Length 770; 

Best Local Similarity 100.0%; Pred. No. 9.8; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 1 LVFFAED 7 

I I I I I I I 

Db 68 8 LVFFAED 694 



RESULT 13 
A4_TETFL 

ID A4_TETFL STANDARD; PRT; 780 AA. 

AC 073683; 

DT 10-OCT-2003 (Rel. 42, Created) 

DT 10-OCT-2003 (Rel. 42, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Alzheimer's disease amyloid A4 protein homolog precursor [Contains: 
DE Beta-amyloid protein (Beta-APP) (A-beta) ] . 



GN APP. 

OS Tetraodon fluviatilis (Puffer fish) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii ; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha; Acanthopterygii ; Percomorpha; Tetraodonti formes ; 

OC Tetradontoidea; Tetraodontidae; Tetraodon. 

OX NCBI_TaxID=47145; 

RN [1] 

RP SEQUENCE FROM N . A. 

RX MEDLINE=98252138; PubMed=9599080 ; 

RA Villard L., Tassone F., Crnogorac- Jurcevic T., Clancy K., Gardiner K. ; 

RT "Analysis of puff erf ish homologues of the AT-rich human APP gene."; 

RL Gene 210:17-24(1998). 

CC -!- FUNCTION: Functional neuronal receptor which couples to 

CC intracellular signaling pathway through the GTP-binding protein 

CC G(O) (By similarity). 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- SIMILARITY: Belongs to the APP family. 

CC -!- SIMILARITY: Contains 1 BPTI/Kunitz inhibitor domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

DR EMBL; AF018165; AAC41275.1; -. 

DR HSSP; P05067; 1HZ3- 

DR InterPro; IPR008155; A4_APP. 

DR InterPro; IPR008154; A4_extra. 

DR InterPro; IPR001255; Beta-APP. 

DR InterPro; IPR002223; Kunitz_BPTI. 

DR Pfam; PF02177; A4_EXTRA; 1. 

DR Pfam; PF03494; Beta-APP; 1. 

DR Pfam; PF00014; Kunitz_BPTI; 1. 

DR PRINTS; PRO 02 03; AMYLOIDA4. 

DR PRINTS; PR00759; BASICPTASE. 

DR ProDom; PD000222; Kunitz_BPTI; 1. 

DR SMART; SM00006; A4_EXTRA; 1. 

DR SMART; SM00131; KU; 1. 

DR PROSITE; PS00319; A4_EXTRA; 1. 

DR PROSITE; PS00320; A4_INTRA; 1. 

DR PROSITE; PS00280; BPTI_KUNITZ_1 ; FALSE_NEG. 

DR PROSITE; PS50279; BPTI_KUNITZ_2 ; 1. 

KW Glycoprotein; Amyloid; Neurone; Transmembrane; Signal; 

KW Serine protease inhibitor. 

FT SIGNAL 1 18 POTENTIAL. 

FT CHAIN 19 780 ALZHEIMER 1 S DISEASE AMYLOID A4 PROTEIN 

FT HOMO LOG . 

FT CHAIN 682 724 BETA-AMYLOID PROTEIN (POTENTIAL) . 

FT DOMAIN 19 711 EXTRACELLULAR (POTENTIAL) . 

FT TRANSMEM 712 732 POTENTIAL. 

FT DOMAIN 733 780 CYTOPLASMIC (POTENTIAL). 

FT DOMAIN 323 382 BPTI/KUNITZ INHIBITOR. 

FT SITE 769 772 CLATHRIN-BINDING (BY SIMILARITY) . 



FT DISULFID 

FT DISULFID 

FT CARBOHYD 

SQ SEQUENCE 



327 378 BY SIMILARITY. 

336 361 BY SIMILARITY . 

560 560 N-LINKED ( GLCNAC . . .) (POTENTIAL) 

780 AA; 88238 MW; 60071BE94520191D CRC64; 



Query Match 85.4%; Score 35; DB 1; Length 780; 

Best Local Similarity 100.0%; Pred. No. 10; 

Matches 7; Conservative 0; Mismatches 0; Indels 



0 ; Gaps 



0; 



Qy 



Db 



1 LVFFAED 7 

I I I I I I I 
698 LVFFAED 704 



RESULT 14 
PE23 SHEEP 



ID PE23_SHEEP STANDARD; PRT; 8 9 AA. 

AC Q28550; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Prostaglandin E2 receptor, EP3 subtype (Prostanoid EP3 receptor) (PGE 

DE receptor, EP3 subtype) (Fragment) . 

GN PTGER3 . 

OS Ovis aries (Sheep) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla; Ruminantia; Pecora; Bovoidea; 

OC Bovidae; Caprinae; Ovis. 

OX NCBI_TaxID=9940; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Kidney outer medulla; 

RX MEDLINE=982 87159; PubMed=9 625477 ; 

RA Audicana L. , Aughey E., O 1 Shaughnessy P.J.; 

RT "Sensitivity of the early luteal phase ovine cervix to prostaglandin 

RT E2 (PGE2) and expression of EP3 receptor mRNA. " ; 

RL Res. Vet. Sci. 64:177-179(1998). 

CC -!- FUNCTION: Receptor for prostaglandin E2 (PGE2); the EP3 receptor 

CC may be involved in inhibition of gastric acid secretion, 

CC modulation of neurotransmitter release in central and peripheral 

CC neurons, inhibition of sodium and water reabsorption in kidney 

CC tubulus and contraction in uterine smooth muscle. The activity of 

CC this receptor can couple to both the inhibition of adenylate 

CC cyclase mediated by G(i) proteins, and to an elevation of 

CC intracellular calcium (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U37148; AAB81195.1; -. 



UK 


inteirro , 


IPR000276; GPCR Rhodpsn. 


UK 


Jr KU o X 1 IIj r 


r j w u zi» -^j i f 


G PROTFTN 


DFfKP yi 1 . FAT.SE NEG 


i\W 


b~procein 




or^ +* r** v • 
I. fcr tJ £. / 


1 I. clIlolllCiLliJ X Ci 1 1 1- / \J-L y UUp IULC1U • 


FT 


l. T/""\'KT m TT" T"l 

NON TER 


1 


1 




FT 


TRANSMEM 


<1 


18 


4 (POTENTIAL). 


FT 


DOMAIN 


19 


48 


EXTRACELLULAR ( POTENTIAL) . 


FT 


TRANSMEM 


49 


74 


5 (POTENTIAL) . 


FT 


DOMAIN 


75 


89 


CYTOPLASMIC (POTENTIAL) . 


FT 


CARBOHYD 


38 


38 


N-LINKED (GLCNAC. . .) (POTENTIAL 


FT 


NON_TER 


89 


89 




SQ 


SEQUENCE 


89 AA; 


9364 MW; 


EDAD27E127B0A428 CRC64; 



Query Match 



78. 0%; 



Best Local Similarity 85.7%; 
Matches 6; Conservative 



Score 32; DB 1; Length 89; 
PrecL No. 5.1; 
0; Mismatches 1; Indels 



0 ; Gap s 



0; 



Qy 



Db 



2 VFFAEDF 8 

I I I I II 
4 9 VFFASDF 55 



RESULT 15 
A4_FUGRU 

ID A4_FUGRU STANDARD; PRT; 737 AA. 

AC 093279; 

DT 10-OCT-2003 (Rel. 42, Created) 

DT 10-OCT-2003 (Rel. 42, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Alzheimer f s disease amyloid A4 protein homolog precursor [Contains: 

DE Beta-amyloid protein (Beta-APP) (A-beta) ] . 

GN APP. 

OS Fugu rubripes (Japanese puff erf ish) (Takifugu rubripes) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha; Acanthopterygii ; Percomorpha; Tetraodontif ormes ; 

OC Tetradontoidea; Tetraodontidae ; Takifugu. 

OX NCBI_TaxID=31033; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE= 98252138; PubMed= 959908 0; 

RA Villard L., Tassone F. , Crnogorac- Jurcevic T., Clancy K. , Gardiner K.; 

RT "Analysis of puff erf ish homologues of the AT-rich human APP gene."; 

RL Gene 210:17-24(1998). 

CC -!- FUNCTION: Functional neuronal receptor which couples to 

CC intracellular signaling pathway through the GTP-binding protein 

CC G(O) (By similarity). 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- SIMILARITY: Belongs to the APP family. 

CC -!- SIMILARITY: Contains 1 BPTI/Kunitz inhibitor domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 



cc 

DR EMBL; AF090120; AAD13392.1; -. 

DR HSSP; P05067; 1HZ3. 

DR InterPro; IPR008155; A4_APP. 

DR InterPro; IPR008154; A4_extra. 

DR InterPro; IPR001255; Beta-APP. 

DR InterPro; IPR002223; Kunitz_BPTI . 

DR Pfam; PF02177; A4_EXTRA; 1. 

DR Pfarn; PF03494; Beta-APP; 1. 

DR Pfam; PF00014; Kunitz_BPTI; 1. 

DR PRINTS; PR00203; AMYLOIDA4 . 

DR PRINTS; PR00759; BASICPTASE. 

DR ProDom; PD000222; Kunitz_BPTI; 1. 

DR SMART; SM00006; A4_EXTRA; 1. 

DR SMART; SM00131; KU; 1. 

DR PROSITE; PS00319; A4_EXTRA; FALSE_NEG. 

DR PROSITE; PS00320; A4 INTRA; 1. 



DR 


PROSITE; 


PS00280; 


BPTI_ 


_KUNITZ 


1; 1. 


DR 


PROSITE; 


PS50279; 


BPTl" 


KUNITZ 


2; 1. 


KW 


Glycoprotein; Amyloid; 


Neurone; Transmembrane; Signal; 


KW 


Serine protease inhibitor. 




FT 


SIGNAL 


1 


18 




POTENTIAL. 


FT 


CHAIN 


19 


737 




ALZHEIMER'S DISEASE AMYLOID A4 


FT 










PROTEIN HOMO LOG. 


FT 


CHAIN 


639 


681 




BETA-AMYLOID PROTEIN (POTENTIAL) . 


FT 


DOMAIN 


19 


668 




EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


669 


689 




POTENTIAL . 


FT 


DOMAIN 


690 


737 




CYTOPLASMIC (POTENTIAL) . 


FT 


DOMAIN 


286 


344 




BPTI/KUNITZ INHIBITOR. 


FT 


SITE 


726 


729 




CLATHRIN-BINDING (BY SIMILARITY) . 


FT 


ACT SITE 


300 


301 




REACTIVE BOND. 


FT 


DISULFID 


290 


340 




BY SIMILARITY. 


FT 


DISULFID 


299 


323 




BY SIMILARITY. 


FT 


DISULFID 


315 


336 




BY SIMILARITY. 


FT 


CARBOHYD 


522 


522 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


SQ 


SEQUENCE 


737 AA; 


82856 MW; 


6FAD01E2E3B2B7E2 CRC64; 



Query Match 78.0%; Score 32; DB 1; Length 737; 

Best Local Similarity 85.7%; Pred. No. 42; 

Matches 6; Conservative 1; Mismatches 0; Indels 



0; Gaps 



0; 



Db 



1 LVFFAED 7 

I I I I I : I 

655 LVFFADD 661 



RESULT 16 
Y691_CHLTR 

ID Y691_CHLTR STANDARD; PRT; 224 AA. 

AC 084697; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Hypothetical protein CT691. 

GN CT691. 

OS Chlamydia trachomatis. 

OC Bacteria; Chlamydiae; Chlamydiales ; Chlamydiaceae; Chlamydia. 



OX NCBI_TaxID=813; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=D/UW- 3/ Cx ; 

RX MEDLINE=99000809; PubMed=978413 6; 

RA Stephens R.S., Kalman S., Lammel C.J., Fan J. f Marathe R. , Aravind L., 

RA Mitchell W.P., Olinger L. , Tatusov R.L., Zhao Q. , Koonin E.V., 

RA Davis R.W.; 

RT "Genome sequence of an obligate intracellular pathogen of humans: 

RT Chlamydia trachomatis."; 

RL Science 282:754-759(1998). 

CC -!- SIMILARITY: Belongs to the UPF0111 family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AE001339; AAC68286.1; -. 

DR PIR; G71483; G71483. 

DR InterPro; IPR002727; DUF47. 

DR Pfam; PF01865; PhoU_div; 1. 

DR TIGRFAMs; TIGR00153; TIGR00153; 1. 

KW Hypothetical protein; Complete proteome. 

SQ SEQUENCE 224 AA; 25176 MW; 661EEB640DD1401E CRC64; 

Query Match 75.6%; Score 31; DB 1; Length 224; 

Best Local Similarity 71.4%; Pred. No. 21; 

Matches 5; Conservative 2; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 VFFAEDF 8 

I I I :: I I 

Db 180 VFFSDDF 18 6 



RESULT 17 


UPK 


CORST 


ID 


UPK CORST STAND7VRD; PRT; 281 AA. 


AC 


Q9FB58; 


DT 


28-FEB-2003 (Rel. 41, Created) 


DT 


28-FEB-2003 (Rel. 41, Last sequence update) 


DT 


28-FEB-2003 (Rel. 41, Last annotation update) 


DE 


Putative undecaprenol kinase (EC 2.7.1.66) (Bacitracin resistance 


DE 


protein) . 


GN 


UPK OR BACA. 


OS 


Corynebacterium striatum. 


OG 


Plasmid pTPlO. 


OC 


Bacteria; Actinobacteria; Actinobacteridae; Actinomycetales ; 


OC 


Corynebacterineae; Corynebacteriaceae; Corynebacterium. 


OX 


NCBI TaxID=43770; 


RN 


[1] 


RP 


SEQUENCE FROM N.A. 


RC 


STRAIN-M82B; 


RX 


MEDLINE=20194 8 06; PubMed-10732668 ; 



RA 
RT 
RT 
RT 
RT 
RL 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



Tauch A., Krieft S., Kalinowski J., Puehler A.; 

"The 51,409-bp R-plasmid pTPIO from the multiresistant clinical 
isolate Corynebacterium striatum M82B is composed of DNA segments 
initially identified in soil bacteria and in plant, animal, and human 
pathogens . " ; 

Mol. Gen. Genet. 263:1-11(2000). 

-!- FUNCTION: Probably phosphorylates undecaprenol to undecaprenyl 
phosphate. Confers resistance to bacitracin (By similarity). 
CATALYTIC ACTIVITY: ATP + undecaprenol = ADP + undecaprenyl 
phosphate . 

SUBCELLULAR LOCATION: Integral membrane protein (Probable) . 
MISCELLANEOUS: Bacitracin is thought to be involved in inhibition 
of peptidoglycan synthesis by sequestering undecaprenyl 
diphosphate reducing the pool of lipid carrier available. 
SIMILARITY: Belongs to the upk family. 



- ! - 

- ! - 
_ t _ 
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This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; AF024666; AAG03365.1; 
HAMAP; MF_01006; -; 1. 
InterPro; IPR003824; BacA. 
Pfam; PF02673; BacA; 1. 

TIGRFAMs; TIGR00753; undec_kin_bacA; 1. 

Transferase; Kinase; Antibiotic resistance; Transmembrane; Plasmid. 



TRANSMEM 


13 


32 


POTENTIAL. 


TRANSMEM 


52 


71 


POTENTIAL. 


TRANSMEM 


98 


115 


POTENTIAL. 


TRANSMEM 


125 


142 


POTENTIAL. 


TRANSMEM 


198 


220 


POTENTIAL. 


TRANSMEM 


230 


252 


POTENTIAL. 


TRANSMEM 


259 


278 


POTENTIAL. 


SEQUENCE 


281 AA; 


30512 


MW; F827 00075DB27EA9 



Query Match 75.6%; Score 31; DB 1; Length 281; 

Best Local Similarity 85.7%; Pred. No. 26; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 LVFFAED 7 

I I I I I : I 
Db 67 LVFFAKD 73 



RESULT 18 
YWBI_BACSU 

ID YWBI_BACSU STANDARD; PRT; 301 AA. 

AC P39592; 

DT 01-FEB-1995 (Rel. 31, Created) 

DT 01-FEB-1995 (Rel. 31, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Putative HTH-type transcriptional regulator ywbl . 

GN YWBI OR IPA-24D OR BSU38310. 



OS Bacillus subtilis. 

OC Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus. 

OX NCBI_TaxID=1423; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=168; 

RX MEDLINE=95020537; PubMed=7934 828 ; 

RA Glaser P., Kunst F. , Arnaud M. , Coudart M.P., Gonzales W. f 

RA Hullo M.F., Ionescu M. , Lubochinsky B., Marcelino L., Moszer I., 

RA Presecan E. f Santana M. f Schneider E., Schweizer J., Vertes A., 

RA Rapoport G., Danchin A.; 

RT "Bacillus subtilis genome project: cloning and sequencing of the 97 

RT kb region from 325 degrees to 333 degrees."; 

RL Mol. Microbiol. 10:371-384(1993). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN-168; 

RX MEDLINE=98 044 033; PubMed=9384 377 ; 

RA Kunst F. , Ogasawara N . , Moszer I., Albertini A.M., Alloni G., 

RA Azevedo V., Bertero M.G., Bessieres P., Bolotin A., Borchert S., 

RA Borriss R. , Boursier L. , Brans A., Braun M. , Brignell S.C., Bron S., 

RA Brouillet S., Bruschi C.V., Caldwell B., Capuano V., Carter N.M., 

RA Choi S.K., Codani J. J., Connerton I.F., Cummings N.J., Daniel R.A., 

RA Denizot F. , Devine K.M. , Dusterhoft A., Ehrlich S.D., Emmerson P.T., 

RA Entian K.D., Errington J., Fabret C, Ferrari E., Foulger D., 

RA Fritz C. f Fujita M. f Fujita Y. , Fuma S., Galizzi A., Galleron N., 

RA Ghim S.Y., Glaser P., Goffeau A., Golightly E.J., Grandi G., 

RA Guiseppi G., Guy B.J., Haga K., Haiech J., Harwood C.R., Henaut A., 

RA Hilbert H. f Holsappel S., Hosono S., Hullo M.F., Itaya M., Jones L., 

RA Joris B., Karamata D. , Kasahara Y., Klaerr-Blanchard M. , Klein C, 

RA Kobayashi Y., Koetter P., Koningstein G., Krogh S., Kumano M. , 

RA Kurita K., Lapidus A., Lardinois S., Lauber J., Lazarevic V., 

RA Lee S.M., Levine A., Liu H. f Masuda S., Mauel C, Medigue C, 

RA Medina N . , Mellado R.P., Mizuno M. , Moestl D., Nakai S., Noback M. , 

RA Noone D. r O'Reilly M. , Ogawa K. , Ogiwara A. , Oudega B., Park S.H., 

RA Parro V., Pohl T.M., Portetelle D., Porwollik S., Prescott A.M., 

RA Presecan E., Pujic P., Purnelle B . , Rapoport G., Rey M. , Reynolds S., 

RA Rieger M. , Rivolta C, Rocha E., Roche B., Rose M. , Sadaie Y., 

RA Sato T., Scanlan E., Schleich S., Schroeter R. , Scoffone F., 

RA Sekiguchi J., Sekowska A., Seror S.J., Serror P., Shin B.S., Soldo B., 

RA Sorokin A. , Tacconi E. , Takagi T., Takahashi H., Takemaru K., 

RA Takeuchi M. , Tamakoshi A. r Tanaka T., Terpstra P., Tognoni A., 

RA Tosato V., Uchiyama S., Vandenbol M. , Vannier F. , Vassarotti A., 

RA Viari A., Wambutt R. , Wedler E., Wedler H. , Weitzenegger T., 

RA Winters P., Wipat A., Yamamoto H., Yamane K. , Yasumoto K. , Yata K. , 

RA Yoshida K. , Yoshikawa H.F., Zumstein E., Yoshikawa H., Danchin A.; 

RT "The complete genome sequence of the Gram-positive bacterium Bacillus 

RT subtilis."; 

RL Nature 390:249-256(1997). 

CC -!- SIMILARITY: Contains 1 HTH lysR-type DNA-binding domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 



CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X73124; CAA51580.1; -. 

DR EMBL; Z99123; CAB15857.1; -. 

DR PIR; S39679; S39679. 

DR SubtiList; BG10570; ywbl . 

DR Inter Pro; IPR000847; HTH_LysR. 

DR InterPro; IPR005119; LysR_subst . 

DR Pfam; PF00126; HTH_1 ; 1. 

DR Pfam; PF03466; LysRjsubstrate; 1 . 

DR PRINTS; PR00039; HTHLYSR. 

DR PROSITE; PS50931; HTH_LYSR; 1. 

KW Hypothetical protein; Transcription regulation; DNA-binding; 

KW Complete proteome. 

FT DOMAIN 1 58 HTH LYSR-TYPE. 

FT DNA_BIND 18 37 H-T-H MOTIF (POTENTIAL). 

SQ SEQUENCE 301 AA; 34677 MW; 63B06C0A7D3D30B9 CRC64 ; 



Query Match 75.6%; 
Best Local Similarity 71.4%; 
Matches 5 ; Conservative 

Qy 2 VFFAEDF 8 

: I I III 
Db 192 IFFREDF 198 



Score 31; DB 1; Length 301; 
Pred. No. 28; 
1; Mismatches 1; Indels 0; Gaps 0; 



RESULT 19 
MM14 PIG 



ID MM14_PIG STANDARD; PRT; 580 AA. 

AC Q9XT90; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Matrix metalloproteinase-14 precursor (EC 3.4.24.80) (MMP-14) 

DE (Membrane-type matrix metalloproteinase 1) (MT-MMP 1) (MTMMP1) 

DE (Membrane-type- 1 matrix metalloproteinase) (MT1-MMP) (MT1MMP) . 

GN MMP14. 

OS Sus scrofa (Pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla; Suina; Suidae; Sus. 

OX NCBI_TaxID=9823; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99095929; PubMed=9881602 ; 

RA Caron C, Xue J., Bartlett J.D.; 

RT "Expression and localization of membrane type 1 matrix 

RT metalloproteinase in tooth tissues."; 

RL Matrix Biol. 17:501-511(1998). 

CC -!- FUNCTION: Seems to specifically activate progelatinase A. May thus 

CC trigger invasion by tumor cells by activating progelatinase A on 

CC the tumor cell surface (By similarity) . May play a role in the 

CC biomineralization of enamel and dentin. 

CC -!- CATALYTIC ACTIVITY: Endopeptidase activity. Activates 

CC progelatinase A by cleavage of the propeptide at 37-Asn- | -Leu-38 . 

CC Other bonds hydrolyzed include 35-Gly- I -Ile-36 in the propeptide 

CC of collagenase 3, and 341-Asn- | -Phe-342 , 441-Asp- | -Leu-442 and 



cc 
cc 
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cc 
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KW 
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FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



_ i — 



354-Gln- | -Thr-355 in the aggrecan interglobular domain. 
COFACTOR: Binds 1 zinc ion per subunit, calcium (By similarity) . 
SUBCELLULAR LOCATION: Type I membrane protein (Potential) . 
TISSUE SPECIFICITY: Highly expressed in developing tooth tissues 
SIMILARITY: Belongs to peptidase family M10A. 
SIMILARITY: Contains 1 hemopexin-like domain. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; AF067419; AAD38324.1; -. 
HSSP; P08254; 1HFS. 
MEROPS; M10.014; -. 
InterPro; IPR000585; Hemopexin. 
InterPro; IPR001818; Pept_M10A_M12B . 
InterPro; IPR006025; Pept_M_Zn_BS . 
InterPro; IPR006026; Peptidase_M. 
Pfam; PF00045; hemopexin; 4. 
Pfam; PF00413; Peptidase_MlO; 1. 
Pfam; PF03933; Peptidase_M10_N; 1. 
PRINTS; PR00138; MATRIXIN. 
SMART; SM00120; HX; 4. 
SMART; SM00235; ZnMc; 1. 
PROSITE; PS00024; HEMOPEXIN; 1. 
PROSITE; PS00142; ZINC_PROTEASE; 1. 
PROSITE; PS00546; CYSTEINE SWITCH; 1. 



Hydrolase; 


Me tallopro tease; 


Zinc; Zymogen; Calcium; Signal; 


Transmembrane . 






SIGNAL 


1 


28 


POTENTIAL . 


PROPEP 


29 


109 


ACTIVATION PEPTIDE. 


CHAIN 


110 


580 


MATRIX METALLO PROTEINASE- 14 . 


DOMAIN 


110 


539 


EXTRACELLULAR (POTENTIAL) . 


TRANSMEM 


540 


560 


POTENTIAL. 


DOMAIN 


561 


580 


CYTOPLASMIC (POTENTIAL). 


DOMAIN 


314 


509 


HEMOPEXIN-LIKE. 


SITE 


91 


91 


CYSTEINE SWITCH (POTENTIAL) . 


METAL 


237 


237 


ZINC (CATALYTIC) (BY SIMILARITY) 


ACT SITE 


238 


238 


BY SIMILARITY. 


METAL 


241 


241 


ZINC (CATALYTIC) (BY SIMILARITY) 


METAL 


247 


247 


ZINC (CATALYTIC) (BY SIMILARITY) 


DISULFID 


317 


506 


BY SIMILARITY. 


SEQUENCE 


580 AA; 


65934 MW 


; B7B2C2C569A96CAC CRC64; 



Query Match 75.6%; Score 31; DB 1; Length 580; 

Best Local Similarity 62.5%; Pred. No. 54; 

Matches 5; Conservative 2; Mismatches 1; Indels 



0; Gaps 



0; 



Qy 



Db 



1 LVFFAEDF 8 

:: I I I I I 
176 MIFFAEGF 183 



RESULT 2 0 
MM1 4_HUMAN 

ID MM14_HUMAN STANDARD; PRT; 582 AA. 

AC P50281; Q92678; 

DT 01-OCT-1996 (Rel. 34, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Matrix metalloproteinase-14 precursor (EC 3.4.24.80) (MMP-14) 

DE (Membrane- type matrix metalloproteinase 1) (MT-MMP 1) (MTMMP1) 

DE (Membrane-type-1 matrix metalloproteinase) (MTl-MMP) (MTlMMP) (MMP- 

DE XI) . 

GN MMP14. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Placenta; 

RX MEDLINE=942 86011; PubMed=8015608 ; 

RA Sato H., Takino T., Okada Y. , Cao J., Shinagawa A., Yamamoto E. , 

RA Seiki M. ; 

RT "A matrix metalloproteinase expressed on the surface of invasive 

RT tumour cells."; 

RL Nature 370:61-65(1994). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Placenta; 

RX MEDLINE-95237627; PubMed=772 1107 ; 

RA Takino T., Sato H., Yamamoto E . , Seiki M. ; 

RT "Cloning of a human gene potentially encoding a novel matrix 

RT metalloproteinase having a C-terminal transmembrane domain."; 

RL Gene 155:2 93-2 98(1995). 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=95224 014; PubMed=77087 15 ; 

RA Okada A., Bellocq J. P., Rouyer N., Chenard M.P., Rio M.C., 

RA Chambon P., Basset P.; 

RT "Membrane-type matrix metalloproteinase (MT-MMP) gene is expressed in 

RT stromal cells of human colon, breast, and head and neck carcinomas."; 

RL Proc. Natl. Acad. Sci. U.S.A. 92:2730-2 734(19 95). 

RN [4] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Lung; 

RX MEDLINE=95377289; PubMed=764 9159 ; 

RA Will H., Hinzmann B.; 

RT "cDNA sequence and mRNA tissue distribution of a novel human matrix 

RT metalloproteinase with a potential transmembrane segment."; 

RL Eur. J. Biochem. 231:602-608(1995). 

RN [5] 

RP SEQUENCE FROM N.A. 

RA Luo G.X., Reisfeld R.A. , Strongin A.Y.; 

RL Submitted (NOV-1995) to the EMBL/ GenBank/DDBJ databases. 

RN [6] 

RP SEQUENCE FROM N.A. 

RA Lohi J.L., Westermarck J., Kaehaeri V.M., Keski-Oja J. ; 

RL Submitted (AUG-1996) to the EMBL/ GenBank/DDBJ databases. 



CC -!- FUNCTION: Seems to specifically activate progelatinase A. May thus 
CC trigger invasion by tumor cells by activating progelatinase A on 

CC the tumor cell surface . 

CC -!- CATALYTIC ACTIVITY: Endopeptidase activity. Activates 

CC progelatinase A by cleavage of the propeptide at 37-Asn- | -Leu-38 . 

CC Other bonds hydrolyzed include 35-Gly- | -Ile-36 in the propeptide 

CC of collagenase 3, and 34 1-Asn- | -Phe-342, 441-Asp- | -Leu-442 and 

CC 354-Gln- | -Thr-355 in the aggrecan interglobular domain. 

CC -!- COFACTOR: Binds 1 zinc ion per subunit, calcium (By similarity). 

CC SUBCELLULAR LOCATION: Type I membrane protein (Potential). 

CC -!- TISSUE SPECIFICITY: In stromal cells of colon, breast, and head 

CC and neck. 

CC -!- SIMILARITY: Belongs to peptidase family Ml OA. 

CC -!- SIMILARITY: Contains 1 hemopexin-like domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; D26512; BAA05519.1; -. 

DR EMBL; X83535; CAA58519.1; -. 

DR EMBL; Z48481; CAA88372.1; -. 

DR EMBL; U41078; AAA83770.1; -. 

DR EMBL; X90925; CAA62432.1; -. 

DR PIR; 138028; 138028. 

DR PDB; 1BQQ; 23-AUG-01. 

DR PDB; 1BUV; 02-SEP-99. 

DR MEROPS; M10.014; 

DR Genew; HGNC:7160; MMP14. 

DR MIM; 600754; -. 

DR GO; GO:0005887; C:integral to plasma membrane; TAS . 

DR GO; GO: 0004222; F : metalloendopeptidase activity; TAS. 

DR GO; GO: 0008270; F:zinc ion binding; TAS. 

DR GO; GO: 0006508; P : proteolysis and peptidolysis ; TAS. 

DR InterPro; IPR000585; Hemopexin. 

DR InterPro; IPR001818; Pept_M10A_M12B . 

DR InterPro; IPR006025; Pept_M_Zn_BS . 

DR InterPro; IPR00602 6; Peptidase JVL 

DR Pf am; PF00045; hemopexin; 4 . 

DR Pfam; PF00413; Peptidase_MlO ; 1. 

DR Pfam; PF03933; Peptidase_M10_N; 1. 

DR PRINTS; PR00138; MATRIXIN . 

DR SMART; SM00120; HX; 4. 

DR SMART; SM00235; ZnMc; 1. 

DR PROSITE; PS00024; HEMOPEXIN; 1. 

DR PROSITE; PS00142; ZINC_PROTEASE; 1. 

DR PROSITE; PS00546; CYSTEINE_SWITCH ; 1. 

KW Hydrolase; Metalloprotease; Zinc; Zymogen; Calcium; Signal; 

KW Transmembrane; 3D-s tructure . 

FT SIGNAL 1 20 POTENTIAL. 

FT PROPEP 21 111 ACTIVATION PEPTIDE. 

FT CHAIN 112 582 MATRIX METALLO PROTEINASE- 14 . 

FT DOMAIN 112 541 EXTRACELLULAR (POTENTIAL) . 







S4 7 

~J T 








JJUi\Lft.-LlN 


\j -j 




PYTDPT AQMTP ( POTF.NTT AT. ) 


FT 

r J. 




O J. \J 


SI 1 


n SliL v L\J L. Hi A X. IN Xj X IV JJj . 


r i 




Zf -3 




LlolcjlrJcj oWll^n \ rui imn x x/\j_i ) . 


TTTII 

r 1 


Mhl AL 


zoy 




ZjINL- \ CA1 ALil 1 -LOJ IijI o ± 1YLX 1j/\K1 x x ; 


FT 


ACT SITE 


240 


240 


BY SIMILARITY. 


FT 


METAL 


243 


243 


ZINC (CATALYTIC) (BY SIMILARITY) 


FT 


METAL 


249 


249 


ZINC (CATALYTIC) (BY SIMILARITY) 


FT 


DISULFID 


319 


508 


BY SIMILARITY. 


FT 


CONFLICT 


338 


338 


K -> E (IN REF. 2, 4, 5 AND 6) . 


FT 


CONFLICT 


500 


500 


S -> P (IN REF. 6) . 


SQ 


SEQUENCE 


582 AA; 


65883 


MW; EFCEDCE6A4 1116F9 CRC64; 



Query Match 75.6%; Score 31; DB 1; Length 582; 

Best Local Similarity 62.5%; Pred. No. 55; 

Matches 5; Conservative 2; Mismatches 1; Inciels 0; Gaps 0; 



Qy 1 LVFFAEDF 8 

: : I I I I I 
Db 178 MIFFAEGF 185 



RESULT 21 
MM14 RABIT 



ID MM14__RABIT STANDARD; PRT; 582 AA. 

AC Q95220; P79225; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Matrix metalloproteinase-14 precursor (EC 3.4.24.80) (MMP-14) 

DE (Membrane-type matrix metalloproteinase 1) (MT-MMP 1) (MTMMP1) 

DE (Membrane-type- 1 matrix metalloproteinase) (MT1-MMP) (MT1MMP) . 

GN MMP14. 

OS Oryctolagus cuniculus (Rabbit) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

OX NCBIJTaxI D=9 9 8 6 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=New Zealand white; TISSUE=Vascular smooth muscle; 

RA Wang H., Keiser J.; 

RL Submitted (FEB-1997) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE OF 1-572 FROM N.A. 

RC STRAIN=New Zealand white; 

RA Sato T. ; 

RL Submitted (OCT-1996) to the EMBL/ GenBank/DDBJ databases. 

CC FUNCTION: Seems to specifically activate progelatinase A. May thus 

CC trigger invasion by tumor cells by activating progelatinase A on 

CC the tumor cell surface (By similarity) . 

CC -!- CATALYTIC ACTIVITY: Endopeptidase activity. Activates 

CC progelatinase A by cleavage of the propeptide at 37-Asn- | -Leu-38 . 

CC Other bonds hydrolyzed include 35-Gly- | -Ile-36 in the propeptide 

CC of collagenase 3, and 341-Asn- | -Phe~342, 441-Asp- | -Leu-442 and 

CC 354-Gln- | -Thr-355 in the aggrecan interglobular domain. 

CC -!- COFACTOR: Binds 1 zinc ion per subunit, calcium (By similarity). 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (Potential). 



CC -!- SIMILARITY: Belongs to peptidase family Ml OA. 

CC -!- SIMILARITY: Contains 1 hemopexin-like domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U83918; AAB41500.1; -. 

DR EMBL; U73940; AAD13803.1; 

DR HSSP; P08254; 1HFS . 

DR MEROPS; M10.014; -. 

DR InterPro; IPR000585; Hemopexin. 

DR InterPro; IPR001818; Pept_M10A_M12B . 

DR InterPro; IPR006025; Pept_M_Zn_BS . 

DR InterPro; IPR006026; Peptidase_M. 

DR Pfam; PF00045; hemopexin; 4. 



DR 


Pfam; PF00413; Peptidase JtflO ; 


1. 


DR 


Pfam; PF03933; Peptidase_MlO_ 


N; 1. 


DR 


PRINTS; PR00138; MATRIXIN. 




DR 


SMART; SM00120; HX; 4. 




DR 


SMART; SM00235; ZnMc; 1. 




DR 


PROSITE; 


PS00024; 


HEMOPEXIN; 


1. 


DR 


PROSITE; 


PS00142; 


ZINC_PROTEASE; 1. 


DR 


PROSITE; 


PS00546; 


CYSTEINE_SWITCH; 1 . 


KW 


Hydrolase 


:; Metalloprotease; Zinc; Zymogen; Calcium; Signal; 


KW 


Transmembrane . 






FT 


SIGNAL 


1 


20 


POTENTIAL. 


FT 


PROPEP 


21 


111 


ACTIVATION PEPTIDE. 


FT 


CHAIN 


112 


582 


MATRIX METALLO PROTEINASE- 14 . 


FT 


DOMAIN 


112 


541 


EXTRACELLULAR (POTENTIAL). 


FT 


TRANSMEM 


542 


562 


POTENTIAL. 


FT 


DOMAIN 


563 


582 


CYTOPLASMIC (POTENTIAL) . 


FT 


DOMAIN 


316 


511 


HEMOPEXIN-LIKE. 


FT 


SITE 


93 


93 


CYSTEINE SWITCH (POTENTIAL) . 


FT 


METAL 


239 


239 


ZINC (CATALYTIC) (BY SIMILARITY) 


FT 


ACT_SITE 


240 


240 


BY SIMILARITY. 


FT 


METAL 


243 


243 


ZINC (CATALYTIC) (BY SIMILARITY) 


FT 


METAL 


249 


249 


ZINC (CATALYTIC) (BY SIMILARITY) 


FT 


DISULFID 


319 


508 


BY SIMILARITY. 


FT 


CONFLICT 


29 


29 


Q -> K (IN REF. 2) . 


FT 


CONFLICT 


268 


268 


K -> N (IN REF. 2) . 


FT 


CONFLICT 


270 


270 


L -> V (IN REF. 2) . 


FT 


CONFLICT 


275 


275 


E -> D (IN REF. 2) . 


FT 


CONFLICT 


292 


296 


RCLLN -> KMPPP (IN REF. 2) . 


FT 


CONFLICT 


298 


300 


GQP -> RTT (IN REF. 2) . 


FT 


CONFLICT 


302 


308 


GLLFRIS -> RTFIPDK (IN REF. 2). 


FT 


CONFLICT 


310 


310 


G -> R (IN REF. 2) . 


FT 


CONFLICT 


317 


317 


K -> N (IN REF. 2) . 


FT 


CONFLICT 


329 


329 


F -> L (IN REF. 2) . 


FT 


CONFLICT 


360 


360 


L -> F (IN REF. 2) . 


SQ 


SEQUENCE 


582 AA; 


65963 MW; 


844624B0AF1B6812 CRC64; 



Query Match 



75.6%; Score 31; DB 1; Length 582; 



Best Local Similarity 62.5%; Pred. No. 55; 

Matches 5; Conservative 2; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

:: I I I I I 

Db 178 MIFFAEGF 185 



RESULT 22 




iKl 1 






ID 


VDT 1 1 Zl IT T? T ^T 1 AMD • DDT t (COO Zi A 




AC 


ni n n A A - 

y x u u 4 ft , 




FlT 1 


01-OCT-1996 (Rel. 34, Created) 




rirp 


01-OCT-1996 (Rel. 34, Last sequence update) 




TlT" 


01-NOV-1997 (Rel. 35, Last annotation update) 




Db 


Hypothetical 71.6 kDa protein T07A5.1 in chromosome III. 




CjN 


T07A5.1. 




CJo 


Caenorhabditis elegans. 






Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea; 




oc 


Rhabditidae; Peloderinae; Caenorhabditis. 




ox 


NCBI TaxID-6239; 




KJM 


[1] 




1\ IT 


SEQUENCE FROM N.A. 




-p> /-* 


STRAIN=Bristol N2 ; 




KA 


Buck D. ; 




DT 
KJ_i 


Submitted (FEB-1995) to the EMBL/GenBank/DDBJ databases. 




CL. 


-!- SIMILARITY: IN THE C-TERMINAL, TO C. ELEGANS W02B3.4 AND W02B3.5. 
















This SWISS-PROT entry is copyright. It is produced through a collaboration 


CC 


between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 


cc 


the European Bioinf ormatics Institute. There are no restrictions on 


its 


CC 


use by non-profit institutions as long as its content is in no 


way 


cc 


modified and this statement is not removed. Usage by and for commercial 


cc 


entities requires a license agreement (See http://www.isb-sib.ch/announce/ 


cc 


or send an email to license@isb-sib . ch) . 




cc 










DR 


EMBL; Z48055; CAA88133.1; -. 




DR 


PIR; T24632; T24632. 




DR 


WormPep; T07A5.1; CE01646. 




DR 


InterPro; IPR007074; LicD. 




DR 


Pfam; PF04991; LicD; 1. 




KW 


Hypothetical protein. 




FT 


DOMAIN 5 8 POLY-LEU. 




FT 


DOMAIN 148 154 POLY-SER. 




SQ 


SEQUENCE 622 AA; 71623 MW; D3E4BDAB67DC3AD0 CRC64; 





Query Match 75.6%; Score 31; DB 1; Length 622; 

Best Local Similarity 75.0%; Pred. No. 58; 

Matches 6; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

I I I I I I 

Db 514 LAMFAEDF 521 



RESULT 23 
MTN2 HUMAN 



ID MTN2_HUMAN STANDARD; PRT; 956 AA. 

AC 000339; Q9NSZ1; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Matrilin-2 precursor. 

GN MATN2 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS LONG AND SHORT) . 

RX PubMed= 11124542; 

RA Muratoglu S., Krysan K., Balazs M. , Sheng H., Zakany R. , Modis L., 

RA Kiss I., Deak F.; 

RT "Primary structure of human matrilin-2, chromosome location of the 

RT MATN2 gene and conservation of an AT -AC intron in matrilin genes."; 

RL Cytogenet. Cell Genet. 90:323-327(2000). 

RN [2] 

RP SEQUENCE OF 644-956 FROM N.A. 

RX MEDLINE-97238863; PubMed=9083061; 

RA Deak F. , Piecha D. , Bachrati C, Paulsson M. , Kiss I.; 

RT "Primary structure and expression of matrilin-2, the closest relative 

RT of cartilage matrix protein within the von Willebrand factor type A- 

RT like module superf amily . " ; 

RL J. Biol. Chem. 272:9268-9274(1997). 

RN [3] 

RP SEQUENCE OF 244-956 FROM N.A. 

RC TISSUE^Testis; 

RA Duesterhoeft A., Lauber J., Mewes H.-W., Gassenhuber J., Wiemann S.; 

RL Submitted (JAN-2000) to the EMBL/ GenBank/DDBJ databases. 
CC FUNCTION: Involved in matrix assembly (By similarity). 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event^Alternative splicing; Named isoforms=2; 

CC Name=Long; 

CC Isold=000339-1; Sequence=Displayed; 

CC Name=Short; 

CC IsoId=O00339-2; Sequence=VSP_001399 ; 

CC -!- SIMILARITY: Contains 10 EGF-like domains. 

CC -!- SIMILARITY: Contains 2 VWFA domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U69263; AAC51260.2; -. 

DR EMBL; AL137638; CAB70853.1; ALT_INIT. 

DR HSSP; P01132; 1EGF. 

DR Genew; HGNC:6908; MATN2 . 

DR MIM; 602108; -. 

DR GO; GO: 0005578; C : extracellular matrix; NAS . 



UK 


InterPro; 


IPR000152; Asx_ 


hydroxyl S . 


UK 


InterPro; 


IPR001881; EGF_ 


Ca. 






Tin 
UK 


InterPro; 


IPR006209; EGF_ 


like . 




UR 


InterPro; 


IPR002035; VWF 


A. 






UR 


Pfam; PF00008; EGF; 10. 








UR 


Pfam; PF00092; vwa; 2. 








UK 


PRINTS; PR00453; 


VWFADOMAIN. 






T\T> 

UK 


SMART; SM00179; EGF_CA; 1 


* 






DR 


SMART; SM00327; VWA; 2. 








DR 


PROSITE; 


PS00010; 


ASX HYDROXYL; 


9. 


DR 


PROSITE; 


PS01186; 


EGF 2; 


9. 






DR 


PROSITE; 


PS50026; 


EGF 3; 


4. 






DR 


PROSITE; 


PS50234; 


VWFA; 2 


* 






KW 


EGF-like 


domain; 


Signal; 


Glycoprotein; Repe 


KW 


Alternative splicing. 
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SIGNAL 
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FT 


T*\ jf TV TIT 

DOMAIN 


c 0 c 

ozo 


r /-c 
OO J 




EGF 
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EGF 
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FT 
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0 3D 
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FT 


UI bULr IU 




ZOO 




BY 


SIMILARITY. 


FT 


UloULt 1U 


O /I Q 

z 4 y 


O £ O 

z bz 




BY 


SIMILARITY. 


t — im 

FT 


UloUbr 1U 


Z b4 


Z / / 




BY 


SIMILARITY. 


t — im 

FT 


UlbU Lit ID 


Zoo 


z y 4 




BY 
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SIMILARITY. 
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346 
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SIMILARITY. 


FT 
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BY 
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FT 


DISULFID 
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Trim 


DISULFID 
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BY 
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h 1 
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1 II 1 1 
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DISULFID 
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DISULFID 
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FT 
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SIMILARITY. 
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DISULFID 
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467 
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DISULFID 
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482 
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FT 


DISULFID 
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499 
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SIMILARITY. 


FT 


DISULFID 


495 


508 
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SIMILARITY. 


FT 


DISULFID 
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523 




BY 


SIMILARITY. 


FT 


DISULFID 


529 


540 




BY 


SIMILARITY. 


FT 


DISULFID 


536 


549 




BY 


SIMILARITY. 


FT 


DISULFID 


551 


564 




BY 


SIMILARITY. 


FT 


DISULFID 


570 


581 




BY 


SIMILARITY. 


FT 


DISULFID 


577 


590 




BY 


SIMILARITY. 


FT 


DISULFID 


592 


605 




BY 


SIMILARITY. 



r 1 


Dl bULr 11J 


Oil 


£9 9 
D^i ^ 


r> v Q TTvTT T ADTTV 




T-'III 

r l 


nTOTTT T?T n 
UlbULr 1 iJ 
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OJ1 


"DV CTTVfTT 7A r> T T 1 V 




L'l 1 1 
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UlbULr ID 






"DV CTMTT ADTTV 

di b IJyii J_iAK± 1 i . 




FT 


CARBOHYD 






N — LlJNJKbU ( bJ-iLNAL. . 


. J ( trUl CjN I 1AL ) 


i ii 1 1 
FT 


<*** TV T"i T"> /MTVrv 

CARBOHYD 


o o n 

o y u 


o o n 

o y U 


N-LlNKbD (GLCNAC 


. ) ( FOI bN I 1AL ) 


FT 


VARSPLIC 


o bl 


o / y 


Missing (in isoiorm 


bnort ) . 


FT 








/FTId=VSP_00l399. 




FT 


CONFLICT 


356 


356 


K -> E (IN REF. 3) . 




FT 


CONFLICT 


594 


594 


V -> E (IN REF. 3) . 




FT 


CONFLICT 


644 


644 


R -> G (IN REF. 2) . 




FT 


CONFLICT 


755 


755 


F -> L (IN REF. 3) . 




FT 


CONFLICT 


935 


935 


L -> F (IN REF. 2) . 




SQ 


SEQUENCE 


956 AA; 


106840 


MW; 826B7F347178FC80 


CRC64; 



Query Match 75.6%; Score 31; DB 1; Length 956; 

Best Local Similarity 71.4%; Pred. No. 90; 

Matches 5; Conservative 2; Mismatches 0; Indels 0; Gaps 0; 



Qy 2 VFFAEDF 8 

: I : I I I I 

Db 814 LFYAEDF 82 0 



RESULT 24 
MTN2_MOUSE 

ID MTN2_MOUSE STANDARD; PRT; 956 AA. 

AC 008746; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Matrilin-2 precursor. 

GN MATN2 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; TISSUE=Limb; 

RX MEDLINE=97238863; PubMed-9083061 ; 

RA Deak F., Piecha D., Bachrati C, Paulsson M. , Kiss I.; 

RT "Primary structure and expression of matrilin-2, the closest relative 

RT of cartilage matrix protein within the von Willebrand factor type A- 

RT like module super f amily. " ; 

RL J. Biol. Chem. 272:9268-9274(1997). 

CC -!- FUNCTION: Involved in matrix assembly (By similarity). 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- TISSUE SPECIFICITY: Detected in a variety of organs, including 
CC calvaria, uterus, heart and brain, as well as fibroblast and 

CC osteoblast cell lines. 

CC -!- SIMILARITY: Contains 10 EGF-like domains. 

CC -!- SIMILARITY: Contains 2 VWFA domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 



cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 



modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; U69262; AAC53163.1; 
HSSP; P05099; 1AQ5 . 
MGD; MGI: 109613; Matn2 . 

GO; GO:0005578; C: extracellular matrix; IDA. 
InterPro; IPR000152; Asx_hydroxyl_S . 
Inter Pro; IPR001881; EGF_Ca. 
InterPro; IPR006209; EGF_like. 
InterPro; IPR006210; IEGF. 
InterPro; IPR002035; VWF_A. 
Pfam; PF00008; EGF; 10. 
Pfam; PF00092; vwa; 2. 
PRINTS; PR00453; VWFADOMAIN. 
SMART; SM00181; EGF; 10. 
SMART; SM00327; VWA; 2. 
PROSITE; PS00010; ASX_HYDROXYL; 7. 
PS01186; EGF_2; 9. 
PS50234; VWFA; 2. 

domain; Signal; Glycoprotein; Repeat; Coiled coil. 



PROSITE; 
PROSITE; 
EGF-like 

SIGNAL 1 

CHAIN 24 

DOMAIN 57 

DOMAIN 238 

DOMAIN 279 

DOMAIN 320 

DOMAIN 361 

DOMAIN 4 02 

DOMAIN 443 

DOMAIN 4 84 

DOMAIN 525 

DOMAIN 566 

DOMAIN 607 

DOMAIN 655 

DOMAIN 917 

DISULFID 242 

DISULFID 249 

DISULFID 264 

DISULFID 283 

DISULFID 290 

DISULFID 305 

DISULFID 324 

DISULFID 331 

DISULFID 346 

DISULFID 365 

DISULFID 372 

DISULFID 387 

DISULFID 406 

DISULFID 413 

DISULFID 428 

DISULFID 447 

DISULFID 454 

DISULFID 469 

DISULFID 488 

DISULFID 495 



23 
956 
232 
278 
319 
360 
401 
442 
483 
524 
565 
606 
647 
830 
955 
253 
2 62 
277 
294 
303 
318 
335 
344 
359 
376 
385 
400 
417 
426 
441 
458 
467 
482 
499 
508 



POTENTIAL. 
MATRILIN-2. 
VWFA 1. 
EGF-LIKE 1. 
EGF-LIKE 2. 
EGF-LIKE 3. 
EGF-LIKE 4. 
EGF-LIKE 5. 
EGF-LIKE 6. 
EGF-LIKE 7. 
EGF-LIKE 8. 
EGF-LIKE 9. 
EGF-LIKE 10. 
VWFA 2. 

COILED COIL (POTENTIAL) 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 

BY SIMILARITY. 
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FT 


DISULFID 


592 


605 


BY 
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BY 
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FT 
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BY 


SIMILARITY. 




FT 


DISULFID 
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BY 


SIMILARITY. 




FT 
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N-LINKED { GLCNAC . 


. .) (potential; 


FT 


CARBOHYD 
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890 


N-LINKED (GLCNAC. 


. .) (POTENTIAL 


SQ 


SEQUENCE 


956 AA; 


106779 


MW; 


3E4C22770B6E6EEE CRC64 ; 


Query Match 




75. 6%; 


Score 31; DB 1; 


Length 956; 



Best Local Similarity 71.4%; Pred. No. 90; 

Matches 5; Conservative 2; Mismatches 0; Indels 0; Gaps 0; 

2 VFFAEDF 8 

: I : I I I I 

814 LFYAEDF 82 0 



RESULT 25 
FAB1_SCHP0 

ID FAB1_SCHP0 STANDARD; PRT; 1932 AA. 

AC 059722; 043072; 

DT 10-OCT-2003 (Rel. 42, Created) 

DT 10-OCT-2003 (Rel. 42, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Phosphatidylinositol-4-phosphate 5-kinase fabl (EC 2.7.1.68) (1- 

DE phosphatidylinositol-4-phosphate kinase) (PIP5K) (Ptdlns (4) P-5-kinase) 

DE ( Diphosphoinositide kinase) . 

GN FAB1 OR STE12 OR SPBC3E7.01 OR SPBC6B1.11C. 

OS Schizosaccharomyces pombe (Fission yeast) . 

OC Eukaryota; Fungi; Ascomycota; Schizosaccharomycetes ; 

OC Schizosaccharomycetales ; Schizosaccharomycetaceae; 

OC Schizosaccharomyces. 

OX NCBI_TaxID=4 8 96; 

RN [1] 

RP SEQUENCE FROM N.A., FUNCTION, AND SUBCELLULAR LOCATION. 

RX MEDLINE=21893055; PubMed=11895483 ; 

RA Morishita M. , Morimoto F., Kitamura K. , Koga T., Fukui Y., Maekawa H., 

RA Yamashita I., Shimoda C. ; 

RT "Phosphatidylinositol 3-phosphate 5-kinase is required for the 

RT cellular response to nutritional starvation and mating pheromone 

RT signals in Schizosaccharomyces pombe."; 

RL Genes Cells 7:199-215(2002). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=972; 

RX MEDLINE=21848401; PubMed=11859360 ; 

RA Wood V., Gwilliam R., Rajandream M.A., Lyne M. , Lyne R., Stewart A., 

RA Sgouros J., Peat N., Hayles J., Baker S., Basham D., Bowman S., 

RA Brooks K., Brown D., Brown S., Chillingworth T., Churcher CM., 

RA Collins M. , Connor R. , Cronin A., Davis P., Feltwell T., Fraser A., 

RA Gentles S., Goble A., Hamlin N., Harris D., Hidalgo J., Hodgson G., 



QY 
Db 



RA Holroyd S., Hornsby T., Howarth S., Huckle E.J., Hunt S., Jagels K., 

RA James K., Jones L., Jones M. , Leather S., McDonald S., McLean J., 

RA Mooney P., Moule S., Mungall K., Murphy L., Niblett D., Odell C, 

RA Oliver K. , O f Neil S. f Pearson D., Quail M.A. , Rabbinowitsch E., 

RA Rutherford K., Rutter S., Saunders D., Seeger K., Sharp S., 

RA Skelton J., Simmonds M. , Squares R. , Squares S., Stevens K., 

RA Taylor K. , Taylor R.G., Tivey A. , Walsh S.V., Warren T., Whitehead S., 

RA Woodward J., Volckaert G. , Aert R. , Robben J., Grymonprez B., 

RA Weltjens I., Vanstreels E., Rieger M. , Schaefer M. , Mueller-Auer S., 

RA Gabel C, Fuchs M. , Fritzc C, Holzer E., Moestl D., Hilbert H., 

RA Borzym K. , Langer I., Beck A., Lehrach H., Reinhardt R., Pohl T.M., 

RA Eger P., Zimmermann W. , Wedler H., Wambutt R., Purnelle B., 

RA Goffeau A., Cadieu E., Dreano S., Gloux S., Lelaure V., Mottier S., 

RA Galibert F., Aves S.J. f Xiang Z. f Hunt C, Moore K., Hurst S.M., 

RA Lucas M. , Rochet M. , Gaillardin C. , Tallada V.A. , Garzon A., Thode G., 

RA Daga R.R., Cruzado L. f Jimenez J., Sanchez M w del Rey Benito J. , 

RA Dominguez A., Revuelta J.L., Moreno S., Armstrong J., Forsburg S.L., 

RA Cerrutti L., Lowe T., McCombie W.R., Paulsen I., Potashkin J., 

RA Shpakovski G.W, Ussery D., Barrell B.G., Nurse P.; 

RT "The genome sequence of Schizosaccharomyces pombe."; 

RL Nature 415:871-88 0(2 002). 

CC -!- FUNCTION: Catalyzes the phosphorylation of phosphatidylinositol-4- 
CC phosphate on the fifth hydroxyl' of the myo-inositol ring, to form 

CC phosphatidylinositol-4 , 5-biphosphate . Required for survival under 

CC conditions of nitrogen starvation. May have a role in the 

CC secretion of phermone peptides. 

CC -!- CATALYTIC ACTIVITY: ATP + 1-phosphatidyl-lD-myo-inositol 4- 
CC monophosphate = ADP + 1-phosphatidyl-lD-myo-inositol 4,5- 

CC bisphosphate . 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. During cell fusion of opposite 
CC mating types, migrates to the isthmus. 

CC -!- SIMILARITY: BELONGS TO THE PTDINS (4) P-5-KINASE FAMILY. 

CC -!- SIMILARITY: Contains 1 FYVE-type zinc finger. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AL023534; CAA19005.1; 

DR EMBL; AL021838; CAA17054.1; -. 

DR PIR; T40375; T40375. 

DR GeneDBJS Pombe; SPBC3E7.01; 

DR GO; GO: 0006629; P:lipid metabolism; ISS. 

DR GO; GO: 0007165; P: signal transduction; ISS. 

DR InterPro; IPR002498; PIP5K. 

DR InterPro; IPR000306; Znf_FYVE. 

DR Pfam; PF01363; FYVE; 1. 

DR SMART; SM00064; FYVE; 1. 

DR SMART; SM00330; PIPKc; 1. 

DR PROSITE; PS50178; ZF_FYVE; 1. 

KW Transferase; Kinase; Zinc-finger; Zinc. 

FT ZN_FING 60 119 FYVE-TYPE. 

SQ SEQUENCE 1932 AA; 220132 MW; B9826516D4504229 CRC64; 



Query Match 75.6%; Score 31; DB 1; Length 1932; 

Best Local Similarity 85.7%; PrecL No. 1.8e+02; 

Matches 6; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 2 VFFAEDF 8 

I I I I I I 

Db 1662 VFFAEQF 1668 



RESULT 26 
MOR2_SCHPO 

ID MOR2_SCHPO STANDARD; PRT; 2196 AA. 

AC Q9HDV6; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Cell polarity protein mor2 (Morphological round protein 2). 

GN MOR2 OR SPBP19A11 . 04C . 

OS Schizosaccharomyces pombe (Fission yeast). 

OC Eukaryota; Fungi; Ascomycota; Schizosaccharomycetes ; 

OC Schizosaccharomycetales ; Schizosaccharomycetaceae; 

OC Schizosaccharomyces . 

OX NCBI_TaxID=4896; 

RN [1] 

RP SEQUENCE FROM N.A., FUNCTION, AND SUBCELLULAR LOCATION. 

RC STRAIN=972; 

RX MEDLINE=22220058 ; PubMed=12234926; 

RA Hirata D., Kishimoto N., Suda M. , Sogabe Y. , Nakagawa S., Yoshida Y., 

RA Sakai K., Mizunuma M. , Miyakawa T., Ishiguro J., Toda T.; 

RT "Fission yeast Mor2/Cpsl2, a protein similar to Drosophila Furry, is 

RT essential for cell morphogenesis and its mutation induces 

RT Weel-dependent G2 delay."; 

RL EMBO J. 21:4863-4874(2002). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=972; 

RX MEDLINE=21848401; PubMed=11859360 ; 

RA Wood V., Gwilliam R. , Rajandream M.A. , Lyne M., Lyne R. , Stewart A., 

RA Sgouros J., Peat N., Hayles J., Baker S., Basham D., Bowman S., 

RA Brooks K., Brown D., Brown S., Chillingworth T., Churcher CM., 

RA Collins M. , Connor R. , Cronin A., Davis P., Feltwell T., Fraser A., 

RA Gentles S., Goble A., Hamlin N., Harris D., Hidalgo J., Hodgson G. , 

RA Holroyd S., Hornsby T., Howarth S., Huckle E.J., Hunt S., Jagels K., 

RA James K. , Jones L., Jones M., Leather S., McDonald S., McLean J., 

RA Mooney P., Moule S., Mungall K., Murphy L., Niblett D., Odell C, 

RA Oliver K., O'Neil S., Pearson D., Quail M.A., Rabbinowitsch E., 

RA Rutherford K. , Rutter S., Saunders D., Seeger K., Sharp S., 

RA Skelton J., Simmonds M. , Squares R. , Squares S., Stevens K., 

RA Taylor K., Taylor R.G., Tivey A., Walsh S.V., Warren T., Whitehead S., 

RA Woodward J., Volckaert G., Aert R. , Robben J., Grymonprez B., 

RA Welt j ens I., Vanstreels E. , Rieger M. , Schaefer M. , Mueller-Auer S., 

RA Gabel C. , Fuchs M. , Fritzc C, Holzer E., Moestl D . , Hilbert H., 

RA Borzym K. , Langer I., Beck A., Lehrach H., Reinhardt R. , Pohl T.M., 

RA Eger P., Zimmermann W., Wedler H-, Wambutt R. , Purnelle B., 

RA Goffeau A., Cadieu E. , Dreano S., Gloux S., Lelaure V., Mottier S., 

RA Galibert F., Aves S.J., Xiang Z. , Hunt C, Moore K., Hurst S.M., 



RA Lucas M. , Rochet M. , Gaillardin C, Tallada V.A. , Garzon A., Thode G., 

RA Daga R.R., Cruzado L., Jimenez J. , Sanchez M. , del Rey F. , Benito J., 

RA Dominguez A., Revuelta J.L., Moreno S., Armstrong J., Forsburg S.L., 

RA Cerrutti L., Lowe T., McCombie W.R., Paulsen I., Potashkin J., 

RA Shpakovski G.V., Ussery D., Barrell B.G., Nurse P.; 

RT "The genome sequence of Schizosaccharomyces pombe."; 

RL Nature 415:871-880(2002). 

CC -!- FUNCTION: Required for the maintenance of cell polarity. Has a 
CC role in localizing F-actin at the cell tips. 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic and membrane-associated; found 
CC at the cell tips and at the site of septum formation. 

CC -!- SIMILARITY: TO YEAST TA03/PAG1. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AB093007; BAC2 0935.1; -. 

DR EMBL; AL512495; CAC19754.1; 

DR GeneDB_S Pombe; SPBP19A11 . 04c; -. 

SQ SEQUENCE 2196 AA; 251195 MW; AAF2 698DB00DE4CF CRC64; 

Query Match 75.6%; Score 31; DB 1; Length 2196; 

Best Local Similarity 100.0%; Pred. No. 2.1e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 VFFAED 7 

I I 1 I I I 

Db 2014 VFFAED 2019 



RESULT 27 




Y681_ 


CHLPN 




ID 


Y681_CHLPN STANDARD; PRT; 224 AA. 




AC 


Q9Z7M3; Q9JQF7; 




DT 


30-MAY-2000 (Rel. 39, Created) 




DT 


30-MAY-2000 (Rel. 39, Last sequence update) 




DT 


10-OCT-2003 (Rel. 42, Last annotation update) 




DE 


Protein CPn0681/CP0066/CP j 0681/CpB0708 . 




GN 


CPN0681 OR CP0066 OR CPJ0681 OR CPB0708. 




OS 


Chlamydia pneumoniae (Chlamydophila pneumoniae) . 




OC 


Bacteria; Chlamydiae; Chlamydiales ; Chlamydiaceae; 


Chlamydophila . 


OX 


NCBI TaxID=83558; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=CWL029; 




RX 


MEDLINE=99206606; PubMed=10192388 ; 




RA 


Kalman S., Mitchell W., Marathe R. , Lammel C, Fan 


J. , Hyman R. W. 


RA 


Olinger L., Grimwood J., Davis R.W., Stephens R.S.; 




RT 


"Comparative genomes of Chlamydia pneumoniae and C. 


trachomatis . " 


RL 


Nat. Genet. 21:385-389(1999). 




RN 


[2] 




RP 


SEQUENCE FROM N.A. 





RC S T RAI N=AR3 9 ; 

RX MEDLINE-2 0150255; PubMed=10684 935 ; 

RA Read T.D., Brunham R.C., Shen C. , Gill S.R., Heidelberg J.F., 

RA White O., Hickey E.K., Peterson J., Utterback T . , Berry K., Bass S., 

RA Linher K., Weidman J., Khouri H. f Craven B., Bowman C, Dodson R. r 

RA Gwinn M. , Nelson W., DeBoy R. , Kolonay J., McClarty G. f Salzberg S.L., 

RA Eisen J., Fraser CM.; 

RT "Genome sequences of Chlamydia trachomatis MoPn and Chlamydia 

RT pneumoniae AR39."; 

RL Nucleic Acids Res. 28:1397-1406(2000). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=J138; 

RX MEDLINE=20330349; PubMed=108 71362 ; 

RA Shirai M., Hirakawa H., Kimoto M. , Tabuchi M. , Kishi F., Ouchi K., 

RA Shiba T. , Ishii K-, Hattori M. , Kuhara S., Nakazawa T . ; 

RT "Comparison of whole genome sequences of Chlamydia pneumoniae J138 

RT from Japan and CWL029 from USA."; 

RL Nucleic Acids Res. 28:2311-2314(2000). 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=TW- 183; 

RA Geng M.M., Schuhmacher A. , Muehldorfer I., Bensch K.W., Schaefer K.P., 

RA Schneider S., Pohl T., Essig A., Marre R. , Melchers K. ; 

RT "The genome sequence of Chlamydia pneumoniae TW183 and comparison with 

RT other Chlamydia strains based on whole genome sequence analysis."; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

CC -!- SIMILARITY: Belongs to the UPF0111 family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AE001650; AAD18820.1; -. 

DR EMBL; AE002170; AAF37955 . 1 ; -. 

DR EMBL; AP002547; BAA98888.1; -. 

DR EMBL; AE017159; AAP98637.1; -. 

DR PIR; E72049; E72049. 

DR PIR; F86575; F86575. 

DR PHCI-2DPAGE; Q9Z7M3; -. 

DR TIGR; CP0066; -. 

DR InterPro; IPR002727; DUF47. 

DR Pfam; PF01865; PhoU__div; 1. 

DR TIGRFAMs; TIGR00153; TIGR00153; 1. 

KW Complete proteome. 

SQ SEQUENCE 224 AA; 25887 MW; 82D1AF044CD11EA0 CRC64; 

Query Match 73.2%; Score 30; DB 1; Length 224; 

Best Local Similarity 57.1%; Pred. No. 35; 

Matches 4; Conservative 3; Mismatches 0; Indels 0; Gaps 0; 



Qy 



2 VFFAEDF 8 
: I I :: I I 



Db 180 IFFSDDF 186 

RESULT 2 8 
SYEJ/7IGBR 

ID SYE_WIGBR STANDARD; PRT; 473 AA. 

AC Q8D375; 

DT 10-OCT-2003 (Rel. 42, Created) 

DT 10-OCT-2003 (Rel. 42, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Glutamyl-tRNA synthetase {EC 6-1.1.17) (Glutamate — tRNA ligase) 

DE (GluRS) . 

GN GLTX OR WIGBR1260. 

OS Wigglesworthia glossinidia brevipalpis . 

OC Bacteria; Proteobacteria ; Gammaproteobacteria; Enterobacteriales ; 

OC Enterobacte riaceae; Wigglesworthia. 

OX NCBI__TaxID=36870; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22297718; PubMed=12219091 ; 

RA Akman L., Yamashita A. , Watanabe H. f Oshima K. , Shiba T., Hattori M. , 

RA Aksoy S . ; 

RT "Genome sequence of the endocellular obligate symbiont of tsetse 

RT flies, Wigglesworthia glossinidia."; 

RL Nat. Genet. 32:4 02-407(2002). 

CC -!- CATALYTIC ACTIVITY: ATP + L-glutamate + tRNA(Glu) = AMP + 

CC diphosphate + L-glutamyl-tRNA(Glu) . 

CC -!- SUBUNIT: Monomer (By similarity). 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- SIMILARITY: Belongs to class-I aminoacyl-tRNA synthetase family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/" 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AB063521; BAC24272.1; 

DR HAMAP; MF_00022; -; 1. 

DR InterPro; IPR004527; GltX_bact. 

DR InterPro; IPR000924; Glu_tRNA-synt_lc . 

DR InterPro; IPR008925; tRNA-synt Jsind. 

DR InterPro; IPR001412; tRNA-synt_I. 

DR Pfam; PF00749; tRNA-synt_lc; 1. 

DR PRINTS; PR00987; TRNASYNTHGLU . 

DR TIGRFAMs ; TIGR00464; gltX_bact; 1. 

DR PR0SITE; PS00178; AA_T RNA_L I GAS E_I ; FALSE_NEG. 

KW Aminoacyl-tRNA synthetase; Protein biosynthesis; Ligase; ATP-binding; 

KW Complete proteome. 

FT SITE 9 19 "HIGH" REGION. 

FT SITE 237 241 "KMSKS" REGION. 

FT BINDING 240 240 ATP (BY SIMILARITY) . 

SQ SEQUENCE 473 AA; 55832 MW; E2 0D10F9F7CFA7C9 CRC64; 



Query Match 



73.2%; Score 30; DB 1; Length 473; 



Best Local Similarity 62.5%; Pred. No. 73; 
Matches 5; Conservative 2; Mismatches 



1; Indels 



0; Gaps 



0; 



Qy 1 LVFFAEDF 8 

: : I I III 

Db 361 ILFFYEDF 368 

RESULT 29 
YQP4_CAEEL 

ID YQP4_CAEEL STANDARD; PRT; 52 9 AA. 

AC Q09531; 

DT 01-OCT-1996 (Rel. 34, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Hypothetical protein F07F6.4 in chromosome III. 

GN F07F6.4. 

OS Caenorhabditis elegans . 

OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea; 

OC Rhabditidae; Peloderinae; Caenorhabditis. 

OX NCBIJTaxID=6239; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN-Bristol N2; 

RA Chissoe S . ; 

RL Submitted (JUL-1995) to the EMBL/ GenBank/ DDB J databases. 

RN [2] 

RP REVISIONS. 

RA Waterston R. ; 

RL Submitted (DEC-2001) to the EMBL/ GenBank/ DDB J databases. 

CC -!- FUNCTION: GTPase-activating protein for the ADP ribosylation 

CC factor family (Potential) . 

CC SIMILARITY: Contains 1 Arf-GAP domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U23486; AAL38955.1; -. 

DR WormPep; F07F6.4; CE29955. 

DR InterPro; IPR001164; hRIP_like. 

DR Pfam; PF01412; ArfGap; 1. 

DR PRINTS; PR00405; REVINTRACTNG . 

DR SMART; SM00105; ArfGap; 1. 

DR PROSITE; PS50115; ARFGAP; 1. 

KW Hypothetical protein; GTPase activation; Zinc-finger. 

FT DOMAIN 13 12 9 ARF-GAP. 

FT ZN_FING 28 51 C4-TYPE. 

FT DOMAIN 375 37 8 POLY-SER. 

FT DOMAIN 404 411 POLY-SER. 

SQ SEQUENCE 529 AA; 57911 MW; FE67BC185F25ECC3 CRC64; 



Query Match 



73.2%; Score 30; DB 1; Length 529; 



Best Local Similarity 83.3%; Pred. No. 82; 

Matches 5; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 3 FFAEDF 8 

Db 153 FFAQDF 158 



RESULT 30 
GRAU_D ROME 

ID GRAU_DROME STANDARD; PRT; 570 AA. 

AC Q9U405; Q9W2N7; 

DT 10-OCT-2003 (Rel. 42, Created) 

DT 10-OCT-2003 (Rel. 42, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Transcription factor grauzone . 

GN GRAU OR CG3282. 

OS Drosophila melanogaster (Fruit fly). 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota ; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophilidae ; Drosophila . 

OX NCBI_TaxID=7227; 

RN [1] 

RP SEQUENCE FROM N.A., FUNCTION, TISSUE SPECIFICITY, AND MUTANTS QQ36 AND 

RP QE70. 

RX MEDLINE=20148536; PubMed=10683177 ; 

RA Chen B., Harms E. , Chu T., Henrion G . , Strickland S.; 

RT "Completion of meiosis in Drosophila oocytes requires transcriptional 

RT control by grauzone, a new zinc finger protein."; 

RL Development 127:1243-1251(2000). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Berkeley; 

RX MEDLINE=20196006; PubMed=10731132 ; 

RA Adams M.D., Celniker S.E., Holt R.A. , Evans C.A., Gocayne J.D., 

RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A. , Galle R.F., 

RA George R.A., Lewis S.E., Richards S., Ashburner M. , Henderson S.N., 

RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q. , Chen L.X., 

RA Brandon R.C., Rogers Y.-H.C, Blazej R.G., Champe M. , Pfeiffer B.D., 

RA Wan K.H., Doyle C, Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., 

RA Abril J.F., Agbayani A., An H.-J., Andrews-Pf annkoch C, Baldwin D. r 

RA Ballew R.M. , Basu A., Baxendale J., Bayraktaroglu L-, Beasley E.M., 

RA Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., 

RA Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., 

RA Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A. , Chandra I., 

RA Cherry J.M., Cawley S., Dahlke C, Davenport L.B., Davies P., 

RA de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., 

RA Dodson K. , Doup L.E., Downes M. , Dugan-Rocha S., Dunkov B.C., Dunn P., 

RA Durbin K.J., Evangelista C.C., Ferraz C, Ferriera S., Fleischmann W., 

RA Fosler C, Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K. , 

RA Glodek A., Gong F., Gorrell J.H., Gu Z . , Guan P., Harris M. , 

RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., 

RA Hostin D., Houston K.A., Howland T.J., Wei M.-H., Ibegwam C, 

RA Jalali M., Kalush F., Karpen G.H., Ke Z . , Kennison J. A., Ketchum K.A., 

RA Kimmel B.E., Kodira CD., Kraft C, Kravitz S., Kulp D. , Lai Z., 

RA Lasko P., Lei Y., Levitsky A. A., Li J.H., Li Z., Liang Y., Lin X., 

RA Liu X., Mattei B., Mcintosh T.C., McLeod M.P., McPherson D . , 



RA Merkulov G. , Milshina N.V., Mobarry C, Morris J. , Moshrefi A. , 

RA Mount S.M., Moy M. , Murphy B., Murphy L., Muzny D.M., Nelson D.L., 

RA Nelson D.R., Nelson K.A. , Nixon K. , Nusskern D.R., Pacleb J.M., 

RA Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., 

RA Reinert K., Remington K., Saunders R.D.C., Scheeler F. , Shen H., 

RA Shue B.C., Siden-Kiamos I., Simpson M. , Skupski M.P., Smith T., 

RA Spier E., Spradling A. C , Stapleton M. , Strong R. , Sun E., 

RA Svirskas R., Tector C. , Turner R. , Venter E., Wang A. H . , Wang X., 

RA Wang Z.-Y., Wassarman D.A., Weinstock G.M., Weissenbach J., 

RA Williams S.M., Woodage T., Worley K.C., Wu D., Yang S., Yao Q.A., 

RA Ye J., Yeh R.-F., Zaveri J.S., Zhan M. , Zhang G. , Zhao Q. , Zheng L., 

RA Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S., Zhu X., Smith H.O., 

RA Gibbs R.A. , Myers E.W. , Rubin G.M., Venter J.C; 

RT "The genome sequence of Drosophila melanogaster . " ; 

RL Science 2 87:2185-2195(2 00 0). 

RN [3] 

RP REVISIONS. 

RX MEDLINE-22426069; PubMed=12537572 ; 

RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S., 

RA Hradecky P., Huang Y. , Kaminker J.S., Millburn G.H., Prochnik S.E., 

RA Smith CD,, Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P., 

RA Bettencourt B.R., Celniker S,E., de Grey A.D.N.J., Drysdale R.A., 

RA Harris N.L., Richter J., Russo S., Schroeder A. J., Shu S.Q., 

RA Stapleton M. , Yamada C, Ashburner M. , Gelbart W.M., Rubin G.M., 

RA Lewis S . E. ; 

RT "Annotation of the Drosophila melanogaster euchromatic genome: a 

RT systematic review. "; 

RL Genome Biol. 3 : RESEARCH0083 . 1-RESEARCH0083 .22 (2 002) . 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN^Berkeley; TISSUE=Head; 

RX MEDLINE=22426066; PubMed=12537569 ; 

RA Stapleton M . r Carlson J.W., Brokstein P., Yu C, Champe M., 

RA George R.A. , Guarin H., Kronmiller B . , Pacleb J.M., Park S., Wan K.H., 

RA Rubin G.M. r Celniker S.E.; 

RT "A Drosophila full-length cDNA resource."; 

RL Genome Biol. 3 : RESEARCH0080 . 1-RESEARCH008 0 .8 (2 002 ) . 

RN [5] 

RP FUNCTION, AND DNA-BINDING. 

RX MEDLINE=20384592; PubMed=10924478 ; 

RA Harms E., Chu T., Henrion G., Strickland S.; 

RT "The only function of Grauzone required for Drosophila oocyte meiosis 

RT is transcriptional activation of the cortex gene."; 

RL Genetics 155:1831-1839(2000). 

CC -!- FUNCTION: Transcription factor involved in female meiosis. 

CC Specifically required to bind to the promoter region of the Cortex 

CC gene and activate its expression in ovaries. 

CC -!- SUBCELLULAR LOCATION: Nuclear. 

CC -!- TISSUE SPECIFICITY: Present throughout embryogenesis , in larvae, 
CC in ovaries and other tissues of adult females, and in adult males. 

CC In ovaries, it is expressed in both follicle cells and nurse 

CC cells. Expressed from stage 9 in the germarium. Weakly or not 

CC expressed in the oocyte nucleus. 

CC -!- DEVELOPMENTAL STAGE: Expressed both maternally and zygotically. 

CC -!- SIMILARITY: Contains 7 C2H2-type zinc fingers. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 



CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF208016; AAF25356.1; -. 

DR EMBL; AE003452; AAF46653.2; -. 

DR EMBL; AY047537; AAK77269.1; 

DR GermOnline; 209995; 

DR FlyBase; FBgn0001133; grau. 

DR GO; GO: 0005634; C:nucleus; IDA. 

DR GO; GO:0003704; F:specific RNA polymerase II transcription fa. . .; IMP. 

DR GO; GO: 0007143; P:female meiosis; IMP. 

DR InterPro; IPR007087; Znf_C2H2 . 

DR Pfam; PF00096; zf~C2H2; 8. 

DR SMART; SM00355; ZnF_C2H2; 8. 

DR PROSITE; PS00028; ZINC_FINGER_C2H2_1 ; 7. 

DR PROSITE; PS50157; ZINC_FINGER_C2H2_2 ; 7. 

KW Meiosis; DNA-binding; Transcription regulation; Activator; 

KW Metal-binding; Zinc; Zinc- finger; Repeat. 

FT ZN_FING 296 319 C2H2-TYPE 1. 

FT ZN_FING 327 349 C2H2-TYPE 2. 

FT ZN_FING 387 4 09 C2H2-TYPE 3. 

FT ZN_FING 417 441 C2H2-TYPE 4. 

FT ZN_FING 448 471 C2H2-TYPE 5. 

FT ZN_FING 476 498 C2H2-TYPE 6. 

FT ZN_FING 504 527 C2H2-TYPE 7. 

FT DOMAIN 293 570 DNA-BINDING. 

FT DOMAIN 383 399 BIPARTITE NUCLEAR LOCALIZATION SIGNAL 1 

FT (POTENTIAL) . 

FT DOMAIN 523 539 BIPARTITE NUCLEAR LOCALIZATION SIGNAL 2 

FT ( POTENTIAL) . 

FT DOMAIN 146 172 ASP/GLU-RICH (ACIDIC) . 

FT MUTAGEN 298 298 C->Y: IN QQ36; DEFECTS IN FEMALE MEIOSIS. 

FT ABOLISHES DNA-BINDING ACTIVITY. 

FT MUTAGEN 493 493 E~>K: IN QE70; DEFECTS IN FEMALE MEIOSIS. 

FT ABOLISHES DNA-BINDING ACTIVITY. 

SQ SEQUENCE 570 AA; 65967 MW; 33B550B964C1EA7C CRC64; 



Query Match 73.2%; 
Best Local Similarity 71.4%; 
Matches 5; Conservative 



Score 30; DB 
Pred. No. 88; 
1; Mismatches 



1; Length 570; 
1; Indels 



0; Gaps 



0; 



QY 



Db 



2 VFFAEDF 8 

: I I I I I 

246 IFLAEDF 252 



RESULT 31 
LICR_BACSU 

ID LICR_BACSU STANDARD; PRT; 641 AA. 

AC P46321; 

DT 01-NOV-1995 (Rel. 32, Created) 

DT 01-NOV-1995 (Rel. 32, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 



DE Probable licABCH operon regulator. 

GN LICR OR CELR OR BSU38600. 

OS Bacillus subtilis. 

OC Bacteria; Firmicutes ; Bacillales; Bacillaceae; Bacillus. 

OX NCBI_TaxID=1423; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=168; 

RX MEDLINE-97144536; PubMed=8 990303; 

RA Tobisch S., Glaser P., Krueger S., Hecker M. ; 

RT "Identification and characterization of a new beta-glucoside 

RT utilization system in Bacillus subtilis."; 

RL J. Bacterid. 17 9:496-506(1997). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=168 / BGSC1A1; 

RX MEDLINE= 97124196; PubMed=8 969509; 

RA Yoshida K.-I., Shindo K. , Sano H., Seki S., Fujimura M. , Yanai N., 

RA Miwa Y. , Fujita Y. ; 

RT "Sequencing of a 65 kb region of the Bacillus subtilis genome 

RT containing the lie and eel loci, and creation of a 177 kb contig 

RT covering the gnt-sacXY region."; 

RL Microbiology 142:3113-3123(1996). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=168; 

RX MEDLINE-98044033; PubMed=9384377 ; 

RA Kunst F., Ogasawara N., Moszer I., Albertini A.M., Alloni G. , 

RA Azevedo V., Bertero M.G., Bessieres P., Bolotin A. , Borchert S., 

RA Borriss R. , Boursier L., Brans A., Braun M. , Brignell S.C., Bron S., 

RA Brouillet S., Bruschi C.V. , Caldwell B., Capuano V., Carter N.M., 

RA Choi S.K., Codani J.J., Connerton I.F., Cummings N.J., Daniel R.A. , 

RA Denizot F., Devine K.M. f Dusterhoft A., Ehrlich S.D. f Emmerson P.T., 

RA Entian K.D., Errington J., Fabret C, Ferrari E. r Foulger D,, 

RA Fritz C, Fujita M., Fujita Y., Fuma S., Galizzi A., Galleron N., 

RA Ghim S.Y., Glaser P., Goffeau A. , Golightly E.J., Grandi G., 

RA Guiseppi G., Guy B.J., Haga K., Haiech J., Harwood C.R., Henaut A., 

RA Hilbert H., Holsappel S . , Hosono S., Hullo M.F., Itaya M., Jones L., 

RA Joris B., Karamata D. f Kasahara Y., Klaerr-Blanchard M. , Klein C, 

RA Kobayashi Y., Koetter P., Koningstein G., Krogh S., Kumano M. , 

RA Kurita K., Lapidus A., Lardinois S., Lauber J., Lazarevic V., 

RA Lee S.M., Levine A., Liu H., Masuda S., Mauel C, Medigue C, 

RA Medina N., Mellado R.P., Mizuno M. , Moestl D., Nakai S., Noback M. , 

RA Noone D., O f Reilly M. , Ogawa K., Ogiwara A., Oudega B., Park S.H., 

RA Parro V., Pohl T.M., Portetelle D., Porwollik S., Prescott A.M., 

RA Presecan E., Pujic P., Purnelle B., Rapoport G., Rey M. , Reynolds S., 

RA Rieger M. , Rivolta C, Rocha E., Roche B., Rose M. , Sadaie Y. , 

RA Sato T., Scanlan E., Schleich S., Schroeter R. , Scoffone F. , 

RA Sekiguchi J., Sekowska A., Seror S.J., Serror P., Shin B.S., Soldo B. , 

RA Sorokin A. , Tacconi E., Takagi T., Takahashi H., Takemaru K., 

RA Takeuchi M., Tamakoshi A. , Tanaka T., Terpstra P., Tognoni A., 

RA Tosato V. , Uchiyama S., Vandenbol M., Vannier F. , Vassarotti A., 

RA Viari A., Wambutt R., Wedler E., Wedler H., Weitzenegger T., 

RA Winters P., Wipat A. , Yamamoto H., Yamane K., Yasumoto K., Yata K. , 

RA Yoshida K., Yoshikawa H.F., Zumstein E., Yoshikawa H. , Danchin A.; 

RT "The complete genome sequence of the Gram-positive bacterium Bacillus 

RT subtilis."; 



RL Nature 390:249-256(1997). 

RN [4] 

RP MUTAGENESIS OF HIS-219; HIS-278; HIS-333; HIS-392 AND HIS-559. 

RX MEDLINE=99369874; PubMed=10438772 ; 

RA Tobisch S., Stuelke J., Hecker M. ; 

RT "Regulation of the lie operon of Bacillus subtilis and 

RT characterization of potential phosphorylation sites of the LicR 

RT regulator protein by site-directed mutagenesis."; 

RL J. Bacteriol. 181:4995-5003(1999). 

CC -!- FUNCTION: Positive regulator of the licABCH operon. 

CC -!- MISCELLANEOUS: Might be activated by phosphorylation of certain 

CC histidyl residues by HPr. 

CC -!- SIMILARITY: Contains 1 PTS EIIA domain. 

CC -!- SIMILARITY: Belongs to the transcriptional antiterminator bglG 
CC family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

DR EMBL; Z49992; CAA90284.1; -. 

DR EMBL; D83026; BAA11742.1; -. 

DR EMBL; Z99123; CAB15886.1; 

DR PIR; H69651; H69651. 

DR SubtiList; BG11346; licR. 

DR InterPro; IPR001550; BglG_antitermin . 

DR InterPro; IPR002178; PTS_EIIA_2. 

DR Pfam; PF00874; PRD; 2. 

DR Pfam; PF00359; PTS_EIIA_2; 1. 

DR ProDom; PD001689; PTS_EIIA_2; 1. 

DR PROSITE; PS00654; ANT I TERMINATORS_BGLG ; FALSE_NEG. 

KW Transcription regulation; Activator; RNA-binding; Phosphorylation; 

KW Complete proteome. 



FT 


DOMAIN 


499 


638 


EIIA 


DOMAIN . 


FT 


MOD_RES 


219 


219 


PHOSPHORYLATION (POTENTIAL) 


FT 


MOD_RES 


278 


278 


PHOSPHORYLATION ( POTENTIAL) 


FT 


MOD_RES 


333 


333 


PHOSPHORYLATION (POTENTIAL) 


FT 


MOD RES 


392 


392 


PHOSPHORYLATION ( POTENTIAL) 


FT 


MOD_RES 


559 


559 


PHOSPHORYLATION ( POTENTIAL) 


FT 


MUTAGEN 


219 


219 


H->D: 


LOSS OF ACTIVITY. 


FT 


MUTAGEN 


219 


219 


H->A: 


RESIDUAL ACTIVITY. 


FT 


MUTAGEN 


278 


278 


H->E: 


LOSS OF ACTIVITY. 


FT 


MUTAGEN 


278 


278 


H->A: 


LOSS OF ACTIVITY. 


FT 


MUTAGEN 


333 


333 


H->A: 


LOSS OF ACTIVITY. 


FT 


MUTAGEN 


392 


392 


H->E: 


LOSS OF ACTIVITY. 


FT 


MUTAGEN 


392 


392 


H->I : 


RESIDUAL ACTIVITY. 


FT 


MUTAGEN 


559 


559 


H->G: 


INCREASE IN ACTIVITY. 


SQ 


SEQUENCE 


641 AA; 


73315 


MW; FCEF83BFC72A0168 CRC64; 



Query Match 73.2%; Score 30; DB 1; Length 641; 

Best Local Similarity 85.7%; Pred. No. 99; 

Matches 6; Conservative 0; Mismatches 1; Indels 0; Gaps 



0; 



Qy 1 LVFFAED 7 

I I I I II 

Db 505 LVFFQED 511 



RESULT 32 
BGAL_LACDE 

ID BGAL__LACDE STANDARD; PRT; 1006 AA. 

AC P20043; 

DT 01-FEB-1991 (Rel. 17, Created) 
DT 01-MAY-1991 (Rel. 18, Last sequence update) 
DT 01-JUN-1994 (Rel. 29, Last annotation update) 
DE Beta-galactosidase (EC 3.2.1.23) (Lactase). 
GN LACZ . 

OS Lactobacillus delbrueckii (subsp. bulgaricus). 

OC Bacteria; Firmicutes ; Lactobacillales ; Lactobacillaceae; 

OC Lactobacillus. 

OX NCBI_TaxID=1585 ; 

RN [1] 

RP SEQUENCE FROM N.A. , AND SEQUENCE OF 1-29. 
RC , STRAIN=B131; 

RX MEDLINE=89123132; PubMed=2492511; 

RA Schmidt B.F., Adams R.M., Requadt C, Power S., Mainzer S.E.; 

RT "Expression and nucleotide sequence of the Lactobacillus bulgaricus 

RT beta-galactosidase gene cloned in Escherichia coli." ; 

RL J. Bacteriol. 171:625-635(1989). 

RN [2] 

RP SEQUENCE FROM N.A. 
RC STRAIN=ATCC 11842; 

RX MEDLINE=91161509; PubMed-17 0592 9 ; 

RA Leong-Morgenthaler P.M., Zwahlen M.C., Hottinger H.; 

RT "Lactose metabolism in Lactobacillus bulgaricus: analysis of the 

RT primary structure and expression of the genes involved."; 

RL J. Bacteriol. 173:1951-1957(1991). 

CC -!- CATALYTIC ACTIVITY: Hydrolysis of terminal, non-reducing beta-D- 
CC galactose residues in beta-D-galactosides . 

CC -!- SUBUNIT: Monomer. 

CC -!- SIMILARITY: Belongs to family 2 of glycosyl hydrolases. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 
CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
CC the European Bioinf ormatics Institute. There are no restrictions on its 
CC use by non-profit institutions as long as its content is in no way 
CC modified and this statement is not removed. Usage by and for commercial 
CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; M23530; AAA25240.1; 
DR EMBL; M55068; AAA25244.1; -. 
DR PIR; A30093; A30093 
DR HSSP; P00722; 1BGL. 

Gal__bind_like . 
Glyco_hydro_2 . 
Glyco_hydro_2Ig . 
Gl y co_h ydr o_2 S B . 
Gl y co_h ydr o_2 TIM. 
Glyco hydro 42C. 



DR InterPro; IPR008979 

DR InterPro; IPR006101 

DR InterPro; IPR006102 

DR InterPro; IPR006104 

DR InterPro; IPR006103 

DR InterPro; IPR004200; 



DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
SQ 



InterPro; I PRO 04 199; Glyco_hydro_42N . 
Pfam; PF02930; Bgal_small_C; 1. 
Pfam; PF02929; Bgal_small_N; 1. 
Pfam; PF00703; Glyco_hydro_2 ; 1. 
Pfam; PF02836; Glyco_hydro_2_C; 1. 
Pfam; PF02837; Glyco_hydro_2_N; 1. 
PRINTS; PR00132; GLHYDRLASE2 . 
PROSITE; PS00719; GLYCOS YL_HYDROL_F2_l ; 
PROSITE; PS00608; GLYCOSYL_HYDROL_F2_2 ; 
Hydrolase; Glycosidase. 



1. 
1. 



INIT MET 


0 


0 


ACT_SITE 


464 


464 


ACT_SITE 


531 


531 


CONFLICT 


903 


903 


SEQUENCE 


1006 


AA; 1 


Query Match 




7 



PROTON DONOR (BY SIMILARITY) 
NUCLEOPHILE (BY SIMILARITY) . 
Y -> YK (IN REF. 2) . 
1006 AA; 113915 MW; E5F81240EB64E769 CRC64; 



.2%; Score 

Best Local Similarity 83.3%; Pred. 
Matches 5; Conservative 1; 



30; DB 1; Length 1006; 
No. 1.6e+02; 
Mismatches 0; Indels 



0; Gaps 



0; 



Qy 

Db 



3 FFAEDF 8 

I : I I I I 
73 FYAEDF 78 



RESULT 33 
UGG2_HUMAN 

ID UGG2_HUMAN STANDARD; PRT; 1516 AA. 

AC Q9NYU1; Q9UFC4 ; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE UDP-glucose : glycoprotein glucosyltrans f erase 2 precursor (EC 2.4.1.-) 

DE (UDP — Glc: glycoprotein glucosyltransf erase 2) (UGT 2) (HUGT2). 

GN UGCGL2 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Fetal liver; 

RX MEDLINE=20160491; PubMed=1069438 0 ; 

RA Arnold S.M., Fessler L.I., Fessler J.H., Kaufman R.J.; 

RT "Two homologues encoding human UDP-glucose : glycoprotein 

RT glucosyltransf erase differ in mRNA expression and enzymatic 

RT activity."; 

RL Biochemistry 39:2149-2163(2000). 

RN [2] 

RP SEQUENCE OF 151-1516 FROM N.A. 

RC TISSUE=Testis; 

RA Bloecker H., Boecher M. , Brandt P., Mewes H.-W., Gassenhuber J., 

RA Wiemann S . ; 

RL Submitted (NOV-1999) to the EMBL/GenBank/DDB J databases. 

CC -!- FUNCTION: Selectively reglucosylates unfolded glycoproteins, thus 
CC providing quality control for protein transport out of the ER. 

CC Unfolded, denatured glycoproteins are substantially better 



CC substrates for glucosylation by this enzyme than are the 

CC corresponding native proteins. This protein and transient 

CC glucosylation may be involved in monitoring and/or assisting the 

CC folding and assembly of newly made glycoproteins, in order to 

CC identify glycoproteins that need assistance in folding from 

CC chaperones . 

CC -!- COFACTOR: Requires calcium and manganese ions for activity. 

CC -!- PATHWAY: Glycosylation . 

CC -!- SUBCELLULAR LOCATION: Endoplasmic reticulum lumen. 

CC -!- TISSUE SPECIFICITY: Expressed in pancreas, heart, brain, placenta, 
CC lung, liver, skeletal muscle, and kidney. 

CC -!- SIMILARITY: Belongs to the glycosyltrans f erase family 8. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF227906; AAF66233.2; -. 

DR EMBL; AL133051; CAB61378.1; -. 

DR PIR; T42654; T42654. 

DR Genew; HGMC: 15664; UGCGL2 . 

DR MIM; 605898; -. 

DR InterPro; IPR000886; ER_target_S. 

DR InterPro; IPR002495; Glyco_trans__8 . 

DR Pfam; PF01501; Glyco_trans f_8 ; 1. 

DR PROSITE; PS00014; ER_TARGET; 1. 

KW Signal; Transferase; Glycosyltransf erase; Endoplasmic reticulum; 

KW Glycoprotein. 



FT 


SIGNAL 


1 


27 


POTENTIAL. 




FT 


CHAIN 


28 


1516 


UDP-GLUCOSE : GLYCOPROTEIN 


FT 








GLUCOSYLTRANSFERASE 2. 




FT 


CARBOHYD 


256 


256 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) . 


FT 


CARBOHYD 


286 


286 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) . 


FT 


CARBOHYD 


920 


920 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) . 


FT 


CARBOHYD 


950 


950 


N-LINKED (GLCNAC. . .) 


(POTENTIAL) . 


FT 


SITE 


1513 


1516 


PREVENT SECRETION FROM 


ER (POTENTIAL) . 


FT 


CONFLICT 


323 


323 


T -> A (IN REF. 2) . 




FT 


CONFLICT 


328 


328 


A -> S (IN REF. 2) . 




FT 


CONFLICT 


821 


821 


T -> A (IN REF. 2) . 




FT 


CONFLICT 


994 


994 


L -> M (IN REF. 2) . 




SQ 


SEQUENCE 


1516 


AA; 174760 


MW; 31A7C0AFF29D1268 CRC64; 



Query Match 73.2%; Score 30; DB 1; Length 1516; 

Best Local Similarity 83.3%; Pred. No. 2.3e+02; 

Matches 5; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 3 FFAEDF 8 

hllll 

Db 887 FYAEDF 892 



RESULT 34 
CA1E CHICK 



ID CA1E_CHICK STANDARD; PRT; 1888 AA. 

AC P32018; 

DT 01-JUL-1993 (Rel. 26, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Collagen alpha 1 (XIV) chain precursor (Undulin) . 

GN C0L14A1. 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Embryo; 

RX MEDLINE=93185668; PubMed=844 4 18 6 ; 

RA Waelchli C, Trueb J., Kessler B., Winterhalter K.H., Trueb B.; 

RT "Complete primary structure of chicken collagen XIV."; 

RL Eur. J. Biochem. 212:483-490(1993). 

RN [2] 

RP SEQUENCE OF 402-1549 FROM N.A. 

RX MEDLINE=92339443; PubMed-133934 9; 

RA Trueb J., Trueb B.; 

RT "Type XIV collagen is a variant of undulin."; 

RL Eur. J. Biochem. 207:549-557(1992). 

RN [3] 

RP STRUCTURE BY NMR OF 1852-1885. 

RX MEDLINE-992 80705; PubMed=103504 66 ; 

RA Montserret R. , Aubert-Foucher E., McLeish M.J., Hill J.M. , Ficheux D., 

RA Jaquinod M. , van der Rest M. , Deleage G . , Penin F. ; 

RT "Structural analysis of the heparin-binding site of the NCI domain of 

RT collagen XIV by CD and NMR."; 

RL Biochemistry 38:6479-6488(1999). 

CC -!- FUNCTION: An adhesive role by integrating collagen bundles. It is 

CC probably associated with the surface of interstitial collagen 

CC fibrils via COLl. The COL2 domain may then serve as a rigid arm 

CC which sticks out from the fibril and protrudes the large N- 

cc terminal globular domain into the extracellular space, where it 

CC might interact with other matrix molecules or cell surface 

CC receptors . 

CC -!- SUBUNIT: Homotrimer (Probable). 

CC -!- SUBCELLULAR LOCATION: Extracellular matrix. 

CC -!- TISSUE SPECIFICITY: Wide tissue distribution; high presence in 
CC dense connective tissue in skeletal muscle. 

CC -!- PTM: Lysines at the third position of the tripeptide repeating 

CC unit (G-X-Y) are hydroxylated in all cases and bind carbohydrates. 

CC -!- PTM: Prolines at the third position of the tripeptide repeating 

CC unit (G-X-Y) are hydroxylated in some or all of the chains. 

CC -!- PTM: May contain numerous cysteine residues involved in inter- and 

CC intramolecular disulfide bonding. 

CC -!- SIMILARITY: BELONGS TO THE FIBRIL-ASSOCIATED COLLAGENS WITH 
CC INTERRUPTED HELICES (FACIT) FAMILY. 

CC -!- SIMILARITY: Contains 1 TSP N-terminal (TSPN) domain. 

CC -!- SIMILARITY: Contains 2 VWFA domains. 

CC -!- SIMILARITY: Contains 6 fibronectin type III domains. 

cc 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 



! 



CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X70793; CAA50064.1; -. 

DR EMBL; X70792; CAA50063.1; -. 

DR EMBL; X66138; CAA46928.1; -. 

DR PIR; A45974; A45974. 

DR PIR; S78476; S78476. 

DR PDB; 1B9P; 25-FEB-99. 

DR PDB; 1B9Q; 25-FEB-99. 

DR InterPro; IPR008160; Collagen. 

DR InterPro; IPR008957; FN_III-like. 

DR InterPro; IPR003961; FN_III. 

DR InterPro; IPR003129; TSPN. 

DR InterPro; IPR002035; VWF_A. 

DR Pfam; PF01391; Collagen; 4. 

DR Pfam; PF00041; fn3; 8. 

DR Pfam; PF02210; TSPN; 1. 

DR Pfam; PF00092; vwa; 2. 

DR PRINTS; PR00453; VWFADOMAIN. 

DR SMART ; SM00060; FN3; 6. 

DR SMART; SM00210; TSPN; 1. 

DR SMART; SM00327; VWA; 2. 

DR PROSITE; PS50234; VWFA; 2. 

KW Extracellular matrix; Connective tissue; Repeat; Hydroxylation; 

KW Cell adhesion; Collagen; Glycoprotein; Signal; 3D-structure . 



FT 


SIGNAL 


1 


28 


POTENTIAL. 




FT 


CHAIN 


29 


1888 


COLLAGEN ALPHA 1 (XIV) 


CHAIN. 


FT 


DOMAIN 


158 


330 


VWFA 1. 




FT 


DOMAIN 


441 


529 


FIBRONECTIN TYPE-III 


1. 


FT 


DOMAIN 


531 


622 


FIBRONECTIN TYPE-III 


2. 


FT 


DOMAIN 


625 


728 


FIBRONECTIN TYPE-III 


3. 


FT 


DOMAIN 


739 


821 


FIBRONECTIN TYPE-III 


4. 


FT 


DOMAIN 


829 


922 


FIBRONECTIN TYPE-III 


5. 


FT 


DOMAIN 


952 


1010 


FIBRONECTIN TYPE-III 


6. 


FT 


DOMAIN 


1042 


1215 


VWFA 2. 




FT 


DOMAIN 


1239 


1444 


TSP N-TERMINAL. 




FT 


DOMAIN 


1227 


1468 


NONHELICAL REGION (NC4) . 


FT 


DOMAIN 


1469 


1620 


TRIPLE-HELICAL REGION 


1. 


FT 


DOMAIN 


1664 


1786 


TRIPLE-HELICAL REGION 


2. 


FT 


DOMAIN 


721 


733 


POLY-THR. 




FT 


CARBOHYD 


138 


138 


N- LINKED (GLCNAC. . . 


) (POTENTIAL) 


FT 


CARBOHYD 


1398 


1398 


N-LINKED (GLCNAC. . . 


) (POTENTIAL) 


FT 


SITE 


1489 


1491 


CELL ATTACHMENT SITE 


(POTENTIAL) . 


FT 


SITE 


1617 


1619 


CELL ATTACHMENT SITE 


(POTENTIAL) . 


FT 


TURN 


1855. 


1856 






FT 


TURN 


1859 


' 1861 






FT 


HELIX 


1862 


1880 






FT 


TURN 


1881 


1882 






SQ 


SEQUENCE 


1888 


AA; 202666 


MW; 39915BB9F46DD873 


CRC64; 



Query Match 

Best Local Similarity 



73.2%; 
71.4%; 



Score 30; DB 1; Length 1888; 
Pred. No. 2.9e+02; 



Matches 5; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 2 VFFAEDF 8 

III : I I 

Db 1199 VFFVDDF 1205 



RESULT 35 
Y126_PYRAB 

ID Y126_PYRAB STANDARD; PRT; 251 AA. 

AC Q9V2E8; 

DT 15-MAR-2004 (Rel. 43, Created) 

DT 15-MAR-2004 (Rel. 43, Last sequence update) 

DT 15-MAR-2004 (Rel, 43, Last annotation update) 

DE Hypothetical UPF0286 protein PYRAB01260. 

GN PYRAB01260 OR PAB2263. 

OS Pyrococcus abyssi. 

OC Archaea; Euryarchaeota; Thermococci ; Thermococcales ; Thermococcaceae; 

OC Pyrococcus . 

OX NCBI_TaxID=29292; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=GE5 / Orsay; 

RX MEDLINE=22 5 11545; PubMed=12 62 2 808; 

RA Cohen G.N., Barbe V., Flament D. , Galperin M. , Heilig R. , Lecompte O. , 

RA Poch O., Prieur D., Querellou J., Ripp R., Thierry J.-C, 

RA Van der Oost J., Weissenbach J., Zivanovic Y., Forterre P.; 

RT "An integrated analysis of the genome of the hyperthermophilic 

RT archaeon Pyrococcus abyssi."; 

RL Mol. Microbiol. 47:1495-1512(2003). 

CC -!- SIMILARITY: Belongs to the UPF0286 family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AJ248283; CAB49050.1; -. 

DR PIR; C75200; C75200. 

DR HAMAP; MF_00722; -; 1. 

DR InterPro; IPR002793; DUF91. 

DR Pfam; PF01939; DUF91; 1. 

DR ProDom; PD013521; DUF91; 1. 

KW Hypothetical protein; Complete proteome. 

SQ SEQUENCE 251 AA; 28831 MW; 648BB6A7CE1E0EFC CRC64; 

Query Match 70.7%; Score 29; DB 1; Length 251; 

Best Local Similarity 62.5%; Pred. No. 64; 

Matches 5; Conservative 2; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

: I I III: 
Db 111 WFLAEDY 118 



RESULT 36 
NU1M_DALCH 

ID NU1M_DALCH STANDARD; PRT; 310 AA. 

AC 063623; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE NADH-ubiquinone oxidoreductase chain 1 (EC 1.6.5.3). 

GN MTND1 OR ND1 . 

OS Dalbulus charlesi. 

OG Mitochondrion. 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Paraneoptera; Hemiptera; Euhemiptera ; Membracoidea; 

OC Cicadellidae; Deltocephalinae; Dalbulus. 

OX NCBI_TaxID=74059; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Dietrich C.H., Fitzgerald S.J., Holmes J.L., Black W.C. IV, 

RA Nault L.R. ; 

RT "Reassessment of Dalbulus leaf hopper (Homoptera: Cicadellidae) 

RT phylogeny based on mitochondrial DNA sequences."; 

RL Ann. Entomol. Soc. Am. 91:590-597(1998). 

CC -!- CATALYTIC ACTIVITY: NADH + ubiquinone = NAD ( + ) + ubiquinol . 

CC -!- SIMILARITY: Belongs to the complex I subunit 1 family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb~sib . ch) . 

CC 

DR EMBL; AF051289; AAC05753.1; -. 

DR InterPro; IPR001694; Resp_NADH_dhl . 

DR Pfam; PF00146; NADHdh; 1. 

DR PROSITE; PS00667; C0MPLEX1_ND1_1 ; FALSE_NEG. 

DR PROSITE; PS00668; C0MPLEX1_ND1_2 ; 1. 

KW Oxidoreductase; NAD; Ubiquinone; Mitochondrion; Transmembrane. 

SQ SEQUENCE 310 AA; 36376 MW; FD59E413A881854B CRC64 ; 

Query Match 70.7%; Score 29; DB 1; Length 310; 
Best Local Similarity 50.0%; Pred. No. 79; 

Matches 4; Conservative 2; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

: : I I II 

Db 240 IIFFGSDF 247 



RESULT 37 
Y189_RICPR 

ID Y189_RICPR STANDARD; PRT; 321 AA. 

AC Q9ZDX5; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 



DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Hypothetical protein RP189. 

GN RP189. 

OS Rickettsia prowazekii. 

OC Bacteria; Proteobacteria; Alphaproteobacteria; Rickettsiales ; 

OC Rickettsiaceae; Rickettsieae ; Rickettsia. 

OX NCBI_TaxID=7 82 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Madrid E; 

RX MEDLINE=99039499; PubMed=98238 93; 

RA Andersson S.G.E., Zomorodipour A., Andersson J.O., 

RA Sicheritz-Ponten T. , Alsmark U.C.M., Podowski R.M. , Naeslund A.K., 

RA Eriksson A.-S., Winkler H.H., Kurland C.G.; 

RT "The genome sequence of Rickettsia prowazekii and the origin of 

RT mitochondria."; 

RL Nature 396:133-140(1998). 

CC -!- SIMILARITY: SOME, TO A.AEOLICUS AQ_1104 . 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AJ235270; CAA14655.1; -. 

DR PIR; H71729; H71729. 

KW Hypothetical protein; Complete proteome. 

SQ SEQUENCE 321 AA; 36653 MW; 3E5F47D104DD8A73 CRC64; 

Query Match 70.7%; Score 29; DB 1; Length 321; 
Best Local Similarity 71.4%; Pred. No. 82; 

Matches 5; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LVFFAED 7 

hill I 

Db 179 LIFFAHD 185 



RESULT 38 
HST2_YEAST 

ID HST2_YEAST STANDARD; PRT; 357 AA. 

AC P53686; 

DT 01-OCT-1996 (Rel. 34, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE NAD-dependent deacetylase HST2 (EC 3.5.1.-) (Homologous to SIR2 

DE protein 2) . 

GN HST2 OR YPL015C OR LPA2C. 

OS Saccharomyces cerevisiae (Baker's yeast). 

OC Eukaryota; Fungi; Ascomycota; Saccharomycotina; Saccharomycetes ; 

OC Saccharomyce tales ; Saccharomycetaceae; Saccharomyces . 

OX NCBI_TaxID-4 932; 
RN [1] 

RP SEQUENCE FROM N.A. 



RC STRAIN=S288c / YPHl; 

RX MEDLINE=9610158 9; PubMed=74 987 86; 

RA Brachmann C.B., Sherman J.M., Devine S.E., Cameron E.E., Pillus L., 

RA Boeke J.D. ; 

RT "The SIR2 gene family, conserved from bacteria to humans, functions 

RT in silencing, cell cycle progression, and chromosome stability."; 

RL Genes Dev. 9:28 88-2 902(1995). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=S288c / AB972; 

RX MEDLINE=97313271; PubMed=9169875 ; 

RA Bussey H., Storms R.K., Ahmed A., Albermann K., Allen E., Ansorge W. , 

RA Araujo R. , Aparicio A., Barrell B.G., Badcock K., Benes V., 

RA Botstein D., Bowman S., Bruckner M. , Carpenter J., Cherry J.M., 

RA Chung E., Churcher CM., Coster F. , Davis K. , Davis R.W., 

RA Dietrich F.S., Delius H., DiPaolo T., Dubois E., Duesterhoeft A., 

RA Duncan M. , Floeth M. , Fortin N., Friesen J.D., Fritz C, Goffeau A., 

RA Hall J., Hebling U., Heumann K., Hilbert H., Hillier L.W., 

RA Hunicke-Smith S., Hyman R. , Johnston M., Kalman S., Kleine K. , 

RA Komp C, Kurdi O., Lashkari D., Lew H., Lin A., Lin D., Louis E.J., 

RA Marathe R. , Messenguy F. , Mewes H.-W., Mirtipati S., Moestl D., 

RA Mueller-Auer S., Namath A., Nentwich U., Oefner P., Pearson D., 

RA Petel F.X., Pohl T.M., Purnelle D. , Schafer M. , Scharfe M. , 

RA Scherens B., Schramm S., Schroeder M. , Sdicu A.M., Tettelin H. , 

RA Urrestarazu L.A., Ushinsky S., Vierendeels F., Vissers S., Voss H., 

RA Walsh S.V. , Wambutt R., Wang Y., Wedler E., Wedler H., Winnett E. , 

RA Zhong W.W. , Zollner A., Vo D.H., Hani J.; 

RT "The nucleotide sequence of Saccharomyces cerevisiae chromosome XVI."; 

RL Nature 387:103-105(1997). 

CC -!- FUNCTION: NAD-dependent histone deacetylase, which could function 
CC in telomeric silencing, cell cycle progression and chromosome 

CC stability. 

CC -!- CATALYTIC ACTIVITY: NAD ( + ) + an acetylprotein = nicotinamide + O- 

CC acetyl-ADP-ribose + a protein. 

CC -!- SUBCELLULAR LOCATION: Nuclear (Potential). 

CC -!- SIMILARITY: Belongs to the sirtuin family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U39063; AAA81035.1; ~. 

DR EMBL; U33335; AAB68090.1; -. 

DR PIR; S59678; S59678. 

DR GermOnline; 143997; -. 

DR SGD; S0005936; HST2 . 

DR GO; GO: 0005737; C: cytoplasm; IDA. 

DR GO; GO: 0017136; F: NAD-dependent histone deacetylase activity; IDA. 

DR GO; GO: 0006348; P: chromatin silencing at telomere; IGI. 

DR InterPro; IPR003000; SIR2. 

DR Pfam; PF02146; SIR2; 1. 

DR PROSITE; PS50305; SIRTUIN; 1. 

KW Hydrolase; NAD; Transcription regulation; Repressor; Nuclear protein; 



KW 

1\ vv 


Metal-binding; Zinc 


■ 

- » 




FT 


DOMAIN 


13 


286 


DEACET YLAS E SIRTUIN-TYPE . 


FT 


ACT SITE 


135 


135 


BY SIMILARITY. 


FT 


METAL 


143 


143 


ZINC (BY SIMILARITY) . 


FT 


METAL 


146 


146 


ZINC (BY SIMILARITY) . 


FT 


METAL 


170 


170 


ZINC (BY SIMILARITY) . 


FT 


METAL 


173 


173 


ZINC (BY SIMILARITY) . 


SQ 


SEQUENCE 


357 AA; 


39979 


MW; ED281E5B8241A4D0 CRC64; 



Query Match 70.7%; Score 29; DB 1; Length 357; 

Best Local Similarity 71.4%; Pred. No. 91; 

Matches 5; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

1 LVFFAED 7 

: I I I II 

181 IVFFGED 187 



Qy 

Db 



RESULT 39 
HYD2_BRAJA 

ID HYD2_BRAJA STANDARD; PRT; 38 0 AA. 

AC P31904; 

DT 01-JUL-1993 (Rel. 26, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Hydrogenase expression/formation protein hypD2 . 

GN HYPD2 OR HYPD 1 OR HYPD OR BLL6928. 

OS Bradyrhizobium japonicurrw 

OC Bacteria; Proteobacteria; Alphaproteobacteria; Rhizobiales; 

OC Bradyrhizobiaceae; Bradyrhizobium. 

OX NCBI_TaxID=375; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=USDA 110; 

RX MEDLINE=224 84998; PubMed=12597275 ; 

RA Kaneko T., Nakamura Y., Sato S., Minamisawa K. , Uchiumi T., 

RA Sasamoto S., Watanabe A., Idesawa K., Iriguchi M. , Kawashima K., 

RA Kohara M. , Matsumoto M. , Shimpo S., Tsuruoka H., Wada T., Yamada M. , 

RA Tabata S.; 

RT "Complete genomic sequence of nitrogen-fixing symbiotic bacterium 

RT Bradyrhizobium japonicum USDA110."; 

RL DNA Res. 9:189-197(2002). 

RN [2] 

RP SEQUENCE OF 34-380 FROM N.A. 

RC STRAIN=CB1809; 

RX MEDLINE=93287991; PubMed=8510650 ; 

RA van Soom C, Verreth C, Sampaio M.J., Vanderleyden J.; 

RT "Identification of a potential transcriptional regulator of 

RT hydrogenase activity in free-living Bradyrhizobium japonicum 

RT strains."; 

RL Mol. Gen. Genet. 239:235-240(1993). 

CC -!- SIMILARITY: Belongs to the hypD family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 



t 



CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AP005960; BAC52193.1; 

DR EMBL; Z17373; CAA78988.1; -. 

DR PIR; S35229; S35229. 

DR InterPro; IPR002780; HypD. 

DR Pfam; PF01924; HypD; 1. 

KW Metal-binding; Complete proteome. 



FT 


METAL 


64 


64 


POTENTIAL. 


FT 


METAL 


67 


67 


POTENTIAL. 


FT 


CONFLICT 


54 


54 


K -> A (IN REF. 2) . 


FT 


CONFLICT 


82 


82 


E -> M (IN REF. 2) . 


FT 


CONFLICT 


261 


261 


S -> T (IN REF. 2) . 


SQ 


SEQUENCE 


380 AA; 


42477 


MW; 2C416E80BE8C2939 CRC64 ; 



Query Match 70.7%; Score 29; DB 1; Length 38 0; 

Best Local Similarity 83.3%; Pred. No. 97; 

Matches 5; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 



Qy 3 FFAEDF 8 

1111:1 

Db 215 FFAEEF 22 0 



RESULT 40 
094B_DROME 

ID O94B_DR0ME STANDARD; PRT; 383 AA. 

AC Q9VCS8; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Putative odorant receptor 94b. 

GN OR94B OR CG6679. 

OS Drosophila melanogaster (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophilidae; Drosophila. 

OX NCBI_TaxID=7227 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Berkeley; 

RX MEDLINE=20196006; PubMed=10731132 ; 

RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A. , Gocayne J.D., 

RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A. , Galle R.F., 

RA George R.A., Lewis S.E., Richards S., Ashburner M. , Henderson S.N., 

RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q. , Chen L.X., 

RA Brandon R.C., Rogers Y.-H.C, Blazej R.G., Champe M. , Pfeiffer B.D., 

RA Wan K.H., Doyle C, Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., 

RA Abril J.F., Agbayani A., An H.-J., Andrews-Pf annkoch C, Baldwin D., 

RA Ballew R.M. , Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., 

RA Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., 

RA Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., 

RA Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., 

RA Cherry J.M., Cawley S., Dahlke C, Davenport L.B., Davies P., 

RA de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., 



RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RT 
RL 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
DR 
DR 
DR 
DR 
KW 
KW 



Palazzolo M. , Pittman G.S 
Reinert K. , Remington K. , 
Shue B.C., Siden-Kiamos I 
Spier E., Spradling A.C., 



Dodson K., Doup L.E., Dowries M., Dugan-Rocha S., Dunkov B.C., Dunn P., 
Durbin K.J., Evangelista CC, Ferraz C, Ferriera S., Fleischmann W., 
Fosler C, Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K., 
Glodek A., Gong F. , Gorrell J.H., Gu Z., Guan P., Harris M., 
Harris N.L., Harvey D.A. , Heiman T.J., Hernandez J.R., Houck J., 
Hostin D., Houston K.A., Howland T.J., Wei M.-H., Ibegwam C, 
Jalali M. , Kalush F. , Karpen G.H., Ke Z., Kennison J. A., Ketchum K.A. , 
Kimmel B.E., Kodira CD., Kraft C f Kravitz S., Kulp D., Lai Z., 
Lasko P., Lei Y., Levitsky A. A., Li J.H., Li Z., Liang Y. , Lin X., 
Liu X., Mattei B., Mcintosh T.C, McLeod M.P., McPherson D . , 
Merkulov G., Milshina N.V., Mobarry C, Morris J., Moshrefi A., 
Mount S.M., Moy M. , Murphy B., Murphy L. , Muzny D.M., Nelson D.L., 
Nelson D.R., Nelson K.A. , Nixon K. , Nusskern D.R., Pacleb J.M., 

, Pan S., Pollard J., Puri V., Reese M.G., 
Saunders R.D.C., Scheeler F., Shen H., 
, Simpson M. , Skupski M.P., Smith T., 
Stapleton M. , Strong R. , Sun E., 
Svirskas R., Tector C, Turner R. , Venter E., Wang A.H., Wang X., 
Wang Z.-Y., Wassarman D.A., Weinstock G.M., Weissenbach J., 
Williams S.M., Woodage T . , Worley K.C, Wu D., Yang S., Yao Q.A., 
Ye J., Yeh R.-F., Zaveri J.S., Zhan M. , Zhang G., Zhao Q., Zheng L., 
Zheng X.H., Zhong F.N., Zhong W. , Zhou X., Zhu S., Zhu X., Smith H.O., 
Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C; 
"The genome sequence of Drosophila melanogaster . " ; 
Science 287:2185-2195(2000). 

FUNCTION: Probable role in the odorant response, being an odorant 
receptor . 

SUBCELLULAR LOCATION: Integral membrane protein (Potential) . 
SIMILARITY: Belongs to family Dr-or of G-protein coupled 
receptors . 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb~sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; AE003741; AAF56077.1; -. 
FlyBase; FBgn0039034; Or94b. 
InterPro; IPR004117; 7tm__6. 
Pfam; PF02949; 7tm_6; 1. 

Hypothetical protein; Transmembrane; G-protein coupled receptor; 
Olfaction; Multigene family. 



- ! - 

- ! - 
_ ! _ 



FT 


DOMAIN 


1 


41 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


42 


62 


1 (POTENTIAL) . 


FT 


DOMAIN 


63 


70 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


71 


91 


2 (POTENTIAL) . 


FT 


DOMAIN 


92 


130 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


131 


151 


3 (POTENTIAL). 


FT 


DOMAIN 


152 


174 


v CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


175 


195 


4 (POTENTIAL) . 


FT 


DOMAIN 


196 


250 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


251 


271 


5 (POTENTIAL) . 


FT 


DOMAIN 


272 


284 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


285 


305 


6 (POTENTIAL). 



FT DOMAIN 

FT TRANSMEM 

FT DOMAIN 

SQ SEQUENCE 



306 358 EXTRACELLULAR (POTENTIAL) 

359 379 7 (POTENTIAL) . 

380 383 CYTOPLASMIC (POTENTIAL) . 

383 AA; 45262 MW; 6475485085672892 CRC64; 



Query Match 70.7%; 
Best Local Similarity 71.4%; 
Matches 5; Conservative 



Score 29; DB 
Pred. No. 98; 
1 ; Mismatches 



1; Length 383; 
1; Indels 



0; Gaps 



0; 



Qy 

Db 



2 VFFAEDF 8 

III II: 

150 VFFQEDY 156 



RESULT 41 
HYPD_RHILV 

ID HYPD_RHILV STANDARD; PRT; 385 AA. 

AC P40598; 

DT 01-FEB-1995 (Rel. 31, Created) 

DT 01-FEB-1995 (Rel. 31, Last sequence update) 

DT 01-NOV-1995 (Rel. 32, Last annotation update) 

DE Hydrogenase expression/ formation protein hypD. 

GN HYPD. 

OS Rhizobium leguminosarum (biovar viciae) . 

OC Bacteria; Proteobacteria ; Alphaproteobacteria ; Rhizobiales; 

OC Rhizobiaceae; Rhizobium/ Agrobacterium group; Rhizobium. 

OX NCBI_TaxID=387; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=128c53; 

RX MEDLINE-93316844; PubMed=8326860; 

RA Rey L., Murillo J., Hernando Y. , Hidalgo E. , Cabrera E., Imperial J., 

RA Ruiz-Argueeso T.; 

RT "Molecular analysis of a microaerobically induced operon required for 

RT hydrogenase synthesis in Rhizobium leguminosarum biovar viciae."; 

RL Mol. Microbiol. 8:471-481(1993). 

CC -!- SIMILARITY: Belongs to the hypD family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X52974; CAA37163.1; -. 

DR PIR; S32877; S32877. 

DR InterPro; IPR002780; HypD. 

DR Pfam; PF01924; HypD; 1. 

DR TIGRFAMs; TIGR00075; hypD; 1. 

KW Metal-binding. 

FT METAL 64 64 POTENTIAL. 

FT METAL 67 67 POTENTIAL . 

SQ SEQUENCE 385 AA; 42028 MW; 0B15216C15665B12 CRC64; 



Query Match 



70.7%; Score 29; DB 1; Length 385; 



Best Local Similarity 83.3%; Pred. No. 98; 
Matches 5; Conservative 1; Mismatches 



0; Indels 



0; Gaps 



0; 



Qy 3 FFAEDF 8 

I I I I : I 

Db 215 FFAEEF 22 0 



RESULT 42 
SYHJYTYCPU 

ID SYH_MYCPU STANDARD; PRT; 420 AA. 

AC Q98QM8; 

DT 10-OCT-2003 (Rel. 42, Created) 

DT 10-OCT-2003 (Rel. 42, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Histidyl-tRNA synthetase (EC 6.1.1.21) (Histidine — tRNA ligase) 

DE (HisRS) . 

GN HISS OR MYPU_3330. 

OS Mycoplasma pulmonis. 

OC Bacteria; Firmicutes; Mollicutes; Mycoplasmataceae ;* Mycoplasma . 

OX NCBI_TaxID=2107 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=UAB CTIP; 

RX MEDLINE=2 1267165; PubMed= 11353084; 

RA Chambaud I., Heilig R. , Ferris S., Barbe V., Samson D., Galisson F., 

RA Moszer I., Dybvig K. , Wroblewski H., Viari A., Rocha E.P.C., 

RA Blanchard A. ; 

RT "The complete genome sequence of the murine respiratory pathogen 

RT Mycoplasma pulmonis."; 

RL Nucleic Acids Res. 29:2145-2153(2001). 

CC -!- CATALYTIC ACTIVITY: ATP + L-histidine + tRNA (His ) = AMP + 
CC diphosphate + L-histidyl-tRNA(His ) . 

CC -!- SUBUNIT: Homodimer (By similarity). 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- SIMILARITY: Belongs to class-II aminoacyl-tRNA synthetase family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AL445564; CAC13506.1; -. 

DR PIR; E90553; E90553. 

DR MypuList; MYPU_3330; 

DR HAMAP; MF_00127; -; 1. 

DR InterPro; IPR004516; HisS. 

DR InterPro; IPR002314; tRNA-synt_2b . 

DR InterPro; IPR006195; tRNA_ligase__II . 

DR Pfam; PF00587; tRNA-synt_2b; 1. 

DR TIGRFAMs; TIGR00442; hisS; 1. 

DR PROSITE; PS50862; AA_T RNA_LI GAS E_I I ; FALSE_NEG. 

KW Aminoacyl-tRNA synthetase; Protein biosynthesis; Ligase; ATP-binding; 

KW Complete proteome. 



SQ SEQUENCE 420 AA; 49308 MW; 951B094 769B34 915 CRC64; 

Query Match 70.7%; Score 29; DB 1; Length 420; 

Best Local Similarity 50.0%; Pred. No. l.le+02; 

Matches 4; Conservative 4; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

: : I I I : : I 

Db 141 IIFFAKNF 148 



RESULT 43 




CLN3 


_CANFA 




TD 


CLN3 CANFA STANDARD; PRT; 438 AA. 




AC 


Q29611; 




DT 


15-JUL-1998 (Rel. 36, Created) 




DT 


15-JUL-1998 (Rel. 36, Last sequence update) 




DT 


30-MAY-2000 (Rel. 39, Last annotation update) 




DE 


CLN3 protein (Battenin) . 




GN 


CLN3. 




OS 


Canis familiaris (Dog) . 




OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 


Euteleostomi ; 


OC 


Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; 


Canis . 


OX 


NCBI TaxID=9615; 




RN 


tl] 




RP 


SEQUENCE FROM N.A. 




RX 


MEDLINE=98250317; PubMed=9590435 ; 




RA 


Shibuya H., Liu P.-C, Katz M.L., Siakotos A.N., Nonneman D.J., 


RA 


Johnson G. S . ; 




RT 


"Coding sequence and exon/intron organization of the 


canine CLN3 


RT 


(Batten disease) gene and its exclusion as the locus 


for 


RT 


ceroid-lipofuscinosis in English setter dogs."; 




RL 


J. Neurosci. Res. 52:268-275(1998). 




CC 


-!- SUBCELLULAR LOCATION: Integral membrane protein. 


Lysosomal (By 


CC 


similarity) . 




cc 
cc 
cc 


-!- SIMILARITY: Belongs to the battenin family. 




This SWISS-PROT entry is copyright. It is produced through a collaboration 


cc 


between the Swiss Institute of Bioinf ormatics and 


the EMBL outstation - 


cc 


the European Bioinf ormatics Institute. There are no 


restrictions on its 


cc 


use by non-profit institutions as long as its content is in no way 


cc 


modified and this statement is not removed. Usage 


by and for commercial 


cc 


entities requires a license agreement (See http://www.isb-sib.ch/announce/ 


cc 
cc 

DR 


or send an email to license@isb-sib . ch) . 




EMBL; L76281; AAB05546.1; 




DR 


EMBL; L76282; AAB05547.1; -. 




DR 


EMBL; AF033661; AAB86972.1; -. 




DR 


EMBL; AF033656; AAB86972.1; JOINED. 




DR 


EMBL; AF033657; AAB86972.1; JOINED. 




DR 


EMBL; AF033658; AAB86972.1; JOINED. 




DR 


EMBL; AF033659; AAB8 6972.1; JOINED. 




DR 


EMBL; AF033660; AAB86972.1; JOINED. 




DR 


InterPro; IPR003492; Battenin. 




DR 


Pfam; PF02487; CLN3; 1. 




DR 


PRINTS; PR01315; BATTENIN. 




KW 


Transmembrane; Lysosome; Glycoprotein. 





FT 


TRANSMEM 


38 


S8 


POTRMTTAT, 


FT 


X XAjTlLM kJl IJ_j1 1 




1 1 Q 
_L X _? 


DnTTTMT T Z1T 
rUl IjIN 1 Xt\±j . 


FT 


TRANSMEM 


128 


148 


POTENTIAL. 


FT 


TRANSMEM 


180 


200 


POTENTIAL. 


FT 


TRANSMEM 


212 


232 


POTENTIAL. 


FT 


TRANSMEM 


278 


298 


POTENTIAL. 


FT 


TRANSMEM 


358 


378 


POTENTIAL. 


FT 


TRANSMEM 


407 


427 


POTENTIAL. 


SQ 


SEQUENCE 


438 AA; 


47933 


MW; 7964662FCDB3387 9 



Query Match 70.7%; Score 29; DB 1; Length 438; 

Best Local Similarity 75.0%; Pred. No. l.le+02; 

Matches 6; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

11:111 I 
Db 290 LVYFAEYF 297 



RESULT 44 
CLN3_MOUSE 

ID CLN3_MOUSE STANDARD; PRT; 438 AA. 

AC Q61124; P70400; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 

DT 30-MAY-2000 (Rel. 39, Last annotation update) 

DE CLN3 protein (Battenin) . 

GN CLN3 . 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Teratocarcinoma; 

RX MEDLINE=97001171; PubMed=8 812504 ; 

RA Lee R.L., Johnson K.R., Lerner T.J.; 

RT "Isolation and chromosomal mapping of a mouse homolog of the Batten 

RT disease gene CLN3. 1 ' ; 

RL Genomics 35:617-619(1996). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; TISSUE=Liver ; 

RX MEDLINE=97295723; PubMed=9151313 ; 

RA Taschner P.E.M., de Vos N. , Breuning M.H.; 

RT "Cross-species homology of the CLN3 gene . " ; 

RL Neuropediatrics 28:18-20(1997). 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Lysosomal (By 
CC similarity) . 

CC -!- SIMILARITY: Belongs to the battenin family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 



CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U47106; AAC52957.1; 

DR EMBL; U68064; AAB07595.1; -. 

DR MGD; MGI: 107537; Cln3 . 

DR InterPro; IPR003492; Battenin. 

DR Pfam; PF02487; CLN3; 1. 

DR PRINTS; PR01315; BATTENIN. 

KW Transmembrane; Lysosome; Glycoprotein. 



FT 


TRANSMEM 


38 


58 




POTENTIAL. 




FT 


TRANSMEM 


99 


119 




POTENTIAL. 




FT 


TRANSMEM 


128 


148 




POTENTIAL. 




FT 


TRANSMEM 


187 


207 




POTENTIAL. 




FT 


TRANSMEM 


212 


232 




POTENTIAL. 




FT 


TRANSMEM 


278 


298 




POTENTIAL. 




FT 


TRANSMEM 


358 


378 




POTENTIAL. 




FT 


TRANSMEM 


407 


427 




POTENTIAL. 




FT 


CONFLICT 


79 


79 




R -> P (IN 


REF. 2) . 


FT 


CONFLICT 


120 


120 




H -> Y (IN 


REF. 2) . 


FT 


CONFLICT 


237 


237 




W -> L (IN 


REF. 2) . 


FT 


CONFLICT 


317 


318 




HE -> QQ (IN REF. 2) . 


FT 


CONFLICT 


380 


380 




H -> Y (IN 


REF. 2) . 


SQ 


SEQUENCE 


438 AA; 


47773 MW; 


0983C604E6959A2C CRC64; 


Query Match 




70. 


7%; 


Score 29; 


DB 1; Length 


Best Local Similarity 


75. 


0%; 


Pred. No. 1 


. le+02; 



Matches 6; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

11:111 I 
Db 2 90 LVYFAEYF 297 



RESULT 45 
CH62__CHLPN 

ID CH62_CHLPN STANDARD; PRT; 52 6 AA. 

AC Q9Z7C9; Q9JS18; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE 60 kDa chaperonin 2 (Protein Cpn60 2) (groEL protein 2). 

GN GROL2 OR GROEL2 OR CPN0777 OR CP1095 OR CPB0805. 

OS Chlamydia pneumoniae ( Chlamydophila pneumoniae) . 

OC Bacteria; Chlamydiae; Chlamydiales ; Chlamydiaceae; Chlamydophila. 

OX NCBI_TaxID-83558 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=CWL029; 

RX MEDLINE=99206606; PubMed=l 01 92388 ; 

RA Kalman S., Mitchell W. , Mar a the R. , Lammel C, Fan J., Hyman R.W., 

RA Olinger L., Grimwood J., Davis R.W., Stephens R.S.; 

RT "Comparative genomes of Chlamydia pneumoniae and C. trachomatis."; 

RL Nat. Genet. 21:385-389(1999). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=AR39; 

RX MEDLINE=20150255; PubMed=10684935 ; 



RA Read T.D., Brunham R.C., Shen C, Gill S.R., Heidelberg J.F., 

RA White O., Hickey E.K., Peterson J., Utterback T . , Berry K., Bass S., 

RA Linher K. , Weidman J., Khouri H., Craven B., Bowman C, Dodson R. , 

RA Gwinn M. , Nelson W., DeBoy R. , Kolonay J., McClarty G., Salzberg S.L., 

RA Eisen J., Fraser CM.; 

RT "Genome sequences of Chlamydia trachomatis MoPn and Chlamydia 

RT pneumoniae AR39."; 

RL Nucleic Acids Res. 28:1397-1406(2000). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=J138; 

RX MEDLINE=20330349; PubMed-10871362 ; 

RA Shirai M. , Hirakawa H., Kimoto M. T Tabuchi M. , Kishi F. , Ouchi K., 

RA Shiba T., Ishii K., Hattori M. , Kuhara S., Nakazawa T.; 

RT "Comparison of whole genome sequences of Chlamydia pneumoniae J138 

RT from Japan and CWL029 from USA. " ; 

RL Nucleic Acids Res. 28:2311-2314(2000). 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=TW- 183; 

RA Geng M.M., Schuhmacher A., Muehldorfer I., Bensch K.W. , Schaefer K.P., 

RA Schneider S., Pohl T., Essig A., Marre R. , Melchers K. ; 

RT "The genome sequence of Chlamydia pneumoniae TW183 and comparison with 

RT other Chlamydia strains based on whole genome sequence analysis."; 

RL Submitted (MAY-2 002) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: Prevents misfolding and promotes the refolding and 

CC proper assembly of unfolded polypeptides generated under stress 

CC conditions (By similarity) . 

CC -!- SUBUNIT: Oligomer of 14 subunits composed of two stacked rings of 
CC 7 subunits (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic (By similarity). 

CC -!- SIMILARITY: Belongs to the chaperonin (HSP60) family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AE001659; AAD18915.1; -. 

DR EMBL; AE002265; AAF38865.1; -. 

DR EMBL; AP002547; BAA98985.1; -. 

DR EMBL; AE017159; AAP98734.1; -. 

DR PIR; D72036; D72036. 

DR PIR; F81504; F81504. 

DR PIR; G86587; G86587. 

DR HSSP; P06139; 1GRL. 

DR TIGR; CP1095; 

DR HAMAP; MF_00600; atypical; 1. 

DR InterPro; IPR001844; Chaprnin_Cpn60 . 

DR InterPro; IPR002423; Cpn60/TCP-l. 

DR InterPro; IPR008950; GroEL-ATPase . 

DR Pfam; PF00118; cpn60_TCPl; 1. 

DR PRINTS; PR002 98; CHAPERONIN60 . 

DR PRINTS; PR00304; TC0MPLEXTCP1 . 



DR PROSITE; PS00296; CHAPERONINS_CPN60 ; FALSE_NEG. 

KW Chaperone; ATP-binding; Complete proteome. 

FT VARIANT 353 353 G -> R (in strains CWL029 and TW-183) 

SQ SEQUENCE 526 AA; 56825 MW; 60AAEFABB7D56238 CRC64 ; 



Query Match 70.7%; Score 29; DB 1; Length 526; 

Best Local Similarity 75.0%; Pred. No. 1.3e+02; 

Matches 6; Conservative 0; Mismatches 2; Indels 



0; Gaps 



0; 



Qy 



Db 



1 LVFFAEDF 8 

II I I I I 
245 LVIIAEDF 252 



RESULT 4 6 
UVRB STRP3 



ID UVRB_STRP3 STANDARD; PRT; 663 AA. 

AC Q8K743; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE UvrABC system protein B (UvrB protein) (Excinuclease ABC subunit B) . 

GN UVRB OR SPYM3_0996 OR SPS0862. 

OS Streptococcus pyogenes (serotype M3 ) . 

OC Bacteria; Firmicutes; Lactobacillales ; Streptococcaceae; 

OC Streptococcus. 

OX NCBI_TaxID=1984 66; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC S T RAI N~MGAS 315 / Serotype M3; 

RX MEDLINE=22133808; PubMed=12122206; 

RA Beres S.B., Sylva G.L., Barbian K.D., Lei B., Hoff J.S., 

RA Mammarella N.D., Liu M.-Y., Smoot J.C., Porcella S.F., Parkins L.D., 

RA Campbell D.S., Smith T.M., McCormick J.K., Leung D.Y.M., 

RA Schlievert P.M., Musser J.M. ; 

RT "Genome sequence of a serotype M3 strain of group A Streptococcus: 

RT phage-encoded toxins, the high-virulence phenotype, and clone 

RT emergence . " ; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:10078-10083(2002). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=SSI-1 / Serotype M3; 

RX MEDLINE=22683278; PubMed=12799345 ; 

RA Nakagawa I., Kurokawa K., Yamashita A., Nakata M. , Tomiyasu Y., 

RA Okahashi N., Kawabata S., Yamazaki K. , Shiba T., Yasunaga T., 

RA Hayashi H . , Hattori M. , Hamada S.; 

RT "Genome sequence of an M3 strain of Streptococcus pyogenes reveals a 

RT large-scale genomic rearrangement in invasive strains and new insights 

RT into phage evolution."; 

RL Genome Res. 13:1042-1055(2003). 

CC -!- FUNCTION: The UvrABC repair system catalyzes the recognition and 
CC processing of DNA lesions. A damage recognition complex composed 

CC of 2 uvrA and 2 uvrB subunits scans DNA for abnormalities. Upon 

CC binding of the uvrA(2)B(2) complex to a putative damaged site, the 

CC DNA wraps around one uvrB monomer. DNA wrap is dependent on ATP 

CC binding by uvrB and probably causes local melting of the DNA 

CC helix, facilitating insertion of uvrB beta-hairpin between the DNA 



CC strands. Then uvrB probes one DNA strand for the presence of a 

CC lesion. If a lesion is found the uvrA subunits dissociate and the 

CC uvrB-DNA preincision complex is formed. This complex is 

CC subsequently bound by uvrC and the second uvrB is released. If no 

CC lesion is found, the DNA wraps around the other uvrB subunit that 

CC will check the other stand for damage (By similarity) . 

CC -!- SUBUNIT: Forms a heterotetramer with uvrA during the search for 

CC lesions. Interacts with uvrC in an incision complex (By 

CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic (By similarity). 

CC -!- DOMAIN: The beta-hairpin motif is involved in DNA binding (By 

CC similarity) . 

CC SIMILARITY: Belongs to the uvrB family. 

CC -!- SIMILARITY: Contains 1 UVR domain. 

CC 1 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AE014156; AAM79603 . 1; -. 

DR EMBL; AP005143; BAC63957.1; 

DR HAMAP; MF_00204; -; 1. 

DR InterPro; IPR001410; DEAD. 

DR InterPro; IPR001650; Helicase_C. 

DR InterPro; IPR004807; UvrB. 

DR InterPro; IPR001943; UvrB/C. 

DR Pfam; PF00271; helicase_C; 1. 

DR Pfam; PF02151; UVR; 1. 

DR SMART; SM004 87; DEXDc; 1. 

DR SMART; SM00490; HELICc; 1. 

DR TIGRFAMs; TIGR00631; uvrb; 1. 

DR PROSITE; PS50151; UVR; 1. 

KW SOS response; Excision nuclease; DNA repair; DNA recombination; 

KW DNA excision; ATP-binding; Complete proteome. 

FT NP_BIND 44 51 ATP (POTENTIAL) . 

FT DOMAIN 97 120 BETA-HAIRPIN. 

FT DOMAIN 627 662 UVR. 

SQ SEQUENCE 663 AA; 75664 MW; E4DA53524A14B0ED CRC64; 

Query Match 70.7%; Score 29; DB 1; Length 663; 
Best Local Similarity 75.0%; Pred. No. 1.7e+02; 

Matches 6; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

I II III 

Db 331 LDFFPEDF 338 



RESULT 47 
UVRB_STRP8 

ID UVRB_STRP8 STANDARD; PRT; 663 AA. 

AC Q8P0J7; 

DT 28-FEB-2003 (Rel. 41, Created) 



DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE UvrABC system protein B (UvrB protein) (Excinuclease ABC subunit B) . 

GN UVRB OR SPYM18_132 6. 

OS Streptococcus pyogenes (serotype M18) . 

OC Bacteria; Firmicutes; Lactobacillales ; Streptococcaceae; 

OC Streptococcus . 

OX NCBI_TaxID=18 6103; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=MGAS8232 / Serotype M18; 

RX MEDLINE=21927593; PubMed=11917 108 ; 

RA Smoot J.C., Barbian K.D., Van Gompel J. J., Smoot L.M., Chaussee M.S., 

RA Sylva G.L., Sturdevant D.E., Ricklefs S.M., Porcella S.F., 

RA Parkins L.D., Beres S.B., Campbell D.S., Smith T.M., Zhang Q. , 

RA Kapur V., Daly J. A., Veasy L.G., Musser J.M. ; 

RT "Genome sequence and comparative microarray analysis of serotype M18 

RT group A Streptococcus strains associated with acute rheumatic fever 

RT outbreaks."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:4 668-4 673(2 002). 

CC -!- FUNCTION: The UvrABC repair system catalyzes the recognition and 

CC processing of DNA lesions. A damage recognition complex composed 

cc of 2 uvrA and 2 uvrB subunits scans DNA for abnormalities. Upon 

CC binding of the uvrA(2)B(2) complex to a putative damaged site, the 

CC DNA wraps around one uvrB monomer. DNA wrap is dependent on ATP 

CC binding by uvrB and probably causes local melting of the DNA 

CC helix, facilitating insertion of uvrB beta-hairpin between the DNA 

CC strands. Then uvrB probes one DNA strand for the presence of a 

CC lesion. If a lesion is found the uvrA subunits dissociate and the 

CC uvrB- DNA preincision complex is formed. This complex is 

CC subsequently bound by uvrC and the second uvrB is released. If no 

CC lesion is found, the DNA wraps around the other uvrB subunit that 

CC will check the other stand for damage (By similarity) . 

CC -!- SUBUNIT: Forms a heterotetramer with uvrA during the search for 

CC lesions. Interacts with uvrC in an incision complex (By 

CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic (By similarity). 

CC -!- DOMAIN: The beta-hairpin motif is involved in DNA binding (By 

CC similarity) . 

CC -!- SIMILARITY: Belongs to the uvrB family. 

CC -!- SIMILARITY: Contains 1 UVR domain. 

CC . 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC ■ 

DR EMBL; AE010054; AAL97930.1; -. 

DR HAMAP; MF_00204; -; 1. 

DR InterPro; IPR001410; DEAD. 

DR InterPro; IPR001650; Helicase_C. 

DR InterPro; IPR004807; UvrB. 

DR InterPro; IPR001943; UvrB/C. 

DR Pfam; PF00271; helicase C; 1. 



DR Pfam; PF02151; UVR; 1. 

DR TIGRFAMs ; TIGR00631; uvrb; 1. 

DR PROSITE; PS50151; UVR; 1. 

KW SOS response; Excision nuclease; DNA repair; DNA recombination; 

KW DNA excision; ATP-binding; Complete proteome, 

FT NP_BIND 44 51 ATP (POTENTIAL) . 

FT DOMAIN 97 120 BETA-HAIRPIN. 

FT DOMAIN 627 662 UVR. 

SQ SEQUENCE 663 AA; 75697 MW; D2EC5BFECE34B245 CRC64; 



Query Match 70.7%; Score 29; DB 1; Length 663; 

Best Local Similarity 75.0%; Pred. No. 1.7e+02; 

Matches 6; Conservative 0; Mismatches 2; Indels 



0; Gaps 



0; 



Qy 



Db 



1 LVFFAEDF 8 

I II III 

331 LDFFPEDF 338 



RESULT 48 
UVRB STRPY 



ID UVRBJ3TRPY STANDARD; PRT; 663 AA. 

AC Q99ZA5; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE UvrABC system protein B (UvrB protein) (Excinuclease ABC subunit B) . 

GN UVRB OR SPY1314. 

OS Streptococcus pyogenes. 

OC Bacteria; Fionicutes ; Lactobacillales ; Streptococcaceae ; 

OC Streptococcus. 

OX NCBI_TaxID=1314; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=SF370 / ATCC 700294 / Serotype Ml; 

RX MEDLINE=21192684; PubMed=112962 96 ; 

RA Ferretti J. J., McShan W.M., Ajdic D.J., Savic D.J., Savic G. , Lyon K., 

RA Primeaux C, Sezate S., Suvorov A.N., Kenton S., Lai H.S., Lin S.P., 

RA Qian Y., Jia H.G., Najar F.Z., Ren Q., Zhu H., Song L., White J., 

RA Yuan X., Clifton S.W., Roe B.A., McLaughlin R. ; 

RT "Complete genome sequence of an Ml strain of Streptococcus pyogenes."; 

RL Proc. Natl. Acad. Sci. U.S.A. 98:4658-4663(2001). 

CC -!- FUNCTION: The UvrABC repair system catalyzes the recognition and 

CC processing of DNA lesions. A damage recognition complex composed 

CC of 2 uvrA and 2 uvrB subunits scans DNA for abnormalities. Upon 

CC binding of the uvrA(2)B(2) complex to a putative damaged site, the 

CC DNA wraps around one uvrB monomer. DNA wrap is dependent on ATP 

CC binding by uvrB and probably causes local melting of the DNA 

CC helix, facilitating insertion of uvrB beta-hairpin between the DNA 

CC strands. Then uvrB probes one DNA strand for the presence of a 

CC lesion. If a lesion is found the uvrA subunits dissociate and the 

CC uvrB- DNA preincision complex is formed. This complex is 

CC subsequently bound by uvrC and the second uvrB is released. If no 

CC lesion is found, the DNA wraps around the other uvrB subunit that 

CC will check the other stand for damage (By similarity) . 

CC -!- SUBUNIT: Forms a heterotetramer with uvrA during the search for 

CC lesions. Interacts with uvrC in an incision complex (By 



CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic (By similarity). 

CC -!- DOMAIN: The beta-hairpin motif is involved in DNA binding (By 

CC similarity) . 

CC -!- SIMILARITY: Belongs to the uvrB family. 

CC -!- SIMILARITY: Contains 1 UVR domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AE006570; AAK34157.1; 

DR HSSP; P56981; 1D9X. 

DR HAMAP; MF_00204; -; 1. 

DR InterPro; IPR001410; DEAD. 

DR InterPro; IPR001650; Helicase_C. 

DR InterPro; IPR004807; UvrB. 

DR InterPro; IPR001943; UvrB/C. 

DR Pfam; PF00271; helicase_C; 1. 

DR Pfam; PF02151; UVR; 1. 

DR SMART; SM00487; DEXDc; 1. 

DR , SMART; SM00490; HELICc; 1. 

DR TIGRFAMs; TIGR00631; uvrb; 1. 

DR PROSITE; PS50151; UVR; 1. 

KW SOS response; Excision nuclease; DNA repair; DNA recombination; 

KW DNA excision; ATP-binding; Complete proteome. 

FT NP_BIND 44 51 ATP (POTENTIAL). 

FT DOMAIN 97 120 BETA-HAIRPIN. 

FT DOMAIN 627 662 UVR. 

SQ SEQUENCE 663 AA; 75654 MW; 999CB73C0D49C53A CRC64 ; 

Query Match 70.7%; Score 29; DB 1; Length 663; 
Best Local Similarity 75.0%; Pred. No. 1.7e+02; 

Matches 6; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

I I I I I I 

Db 331 LDFFPEDF 338 



RESULT 49 
PTR3_YEAST 

ID PTR3_YEAST STANDARD; PRT; 678 AA. 

AC P43606; 

DT 01-NOV-1995 (Rel. 32, Created) 

DT 01-NOV-1995 (Rel. 32, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE PTR3 protein (SSY3 protein) . 

GN PTR3 OR SSY3 OR YFR029W. 

OS Saccharomyces cerevisiae (Baker's yeast). 

OC Eukaryota; Fungi; Ascomycota; Saccharomycotina; Saccharomycetes ; 
OC Saccharomycetales ; Saccharomycetaceae; Saccharomyces. 
OX NCBI TaxID=4932; 



RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=S288c / AB972; 

RX MEDLINE=95400292; PubMed=7670463 ; 

RA Murakami Y., Naitou M. , Hagiwara H., Shibata T. r Ozawa M. , 

RA Sasanuma S.-I., Sasanuma M. , Tsuchiya Y. , Soeda E . , Yokoyama K. , 

RA Yamazaki M. , Tashiro H. , Eki T.; 

RT "Analysis of the nucleotide sequence of chromosome VI from 

RT Saccharomyces cerevisiae . " ; 

RL Nat. Genet. 10:261-2 68(1995). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=S288c / AB972; 

RX MEDLINE=96287654; PubMed=8 68 638 1 ; 

RA Eki T., Naitou M. , Hagiwara H., Abe M. , Ozawa M. , Sasanuma S.-I., 

RA Sasanuma M. , Tsuchiya Y., Shibata T . , Wtanabe K. , Ono A. f 

RA Yamazaki M.-A., Tashiro H., Hanaoka F., Murakami Y. ; 

RT "Fifteen open reading frames in a 30.8 kb region of the right arm of 

RT chromosome VI from Saccharomyces cerevisiae."; 

RL Yeast 12:177-190(1996). 

RN [3] 

RP IDENTIFICATION. 

RX MEDLINE=98144794; PubMed=94 83800; 

RA Joergensen M.U., Bruun M.B., Didion T., Kielland-Brandt M.C.; 

RT "Mutations in five loci affecting GAPl-independent uptake of neutral 

RT amino acids in yeast."; 

RL Yeast 14:103-114(1998). 

CC -!- FUNCTION: Regulator of the expression of the transporters BAP2 , 
CC GAP1 and PTR2 . 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outs'tation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities reguires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; D50617; BAA09268.1; -. 

DR PIR; S56284; S56284. 

DR GermOnline; 140183; -. 

DR SGD; S0001925; PTR3 . 

DR GO; GO: 0019897; C: extrinsic to plasma membrane; IDA. 

DR GO; GO: 0016597; F: amino acid binding; IGI. 

DR GO; GO: 0007606; P : chemosensory perception; IGI. 

SQ SEQUENCE 678 AA; 76286 MW; BF2B64 842D669D6B CRC64; 

Query Match 70.7%; Score 29; DB 1; Length 678; 

Best Local Similarity 83.3%; Pred. No. 1.7e+02; 

Matches 5; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 3 FFAEDF 8 

II III 

Db 654 FFVEDF 659 



RESULT 50 



KCB2 CAN FA 



ID KCB2_CANFA STANDARD; PRT; 809 AA. 

AC Q95167; 

DT 15-JUL-1999 (Rel. 38, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Potassium voltage-gated channel subfamily B member 2 (Potassium 

DE channel Kv2.2) (Fragment). 

GN KCNB2 . 

05 Canis familiaris (Dog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; Canis. 

OX NCBI_TaxID=9615 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98275219; PubMed=9612272 ; 

RA Schmalz F., Kinsella J., Koh S.D., Vogalis F. , Schneider A., 

RA Flynn E.R., Kenyon J.L., Horowitz B. ; 

RT "Molecular identification of a component of delayed rectifier current 

RT in gastrointestinal smooth muscles."; 

RL Am. J. Physiol. 274 : G901-G911 ( 1998 ) . 

RN [2] 

RP CONCEPTUAL TRANSLATION. 

RA Sundaram S . ; 

RL Unpublished observations (OCT-2002). 

CC -!- FUNCTION: Mediates the voltage-dependent potassium ion 

CC permeability of excitable membranes . Channels open or close in 

CC response to the voltage difference across the membrane, letting K+ 

CC ions pass in accordance with their electrochemical gradient. 

CC -!- SUBUNIT: Heteromultimer with KCNS1, KCNS2 and KCNS3 (By 

CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- DOMAIN: The segment S4 is probably the voltage-sensor and is 
CC characterized by a series of positively charged amino acids at 

CC every third position. 

CC -!- DOMAIN: The tail may be important in modulation of channel 

CC activity and/or targeting of the channel to specific subcellular 

CC compartments . 

CC -!- PTM: Phosphorylated (Potential). 

CC SIMILARITY: Belongs to the potassium channel family. B (Shab) 

CC subfamily. 

CC -!- CAUTION: This is a conceptual translation, a frameshift was 
CC introduced in position 768 to restore the similarity with the 

CC other orthologs . 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U69963; AAB08432.1; ALT_FRAME. 

DR HSSP; Q54397; 1BL8 . 

DR InterPro; IPR000210; BTB_P0Z . 

DR InterPro; IPR005821; Ion trans. 



DR InterPro; IPR001622; K-f channel_pore . 

DR InterPro; IPR003091; K_channel . 

DR InterPro; IPR003131; K_tetra. 

DR InterPro; IPR005826; Kv22 channel. 

DR InterPro; IPR003968; Kv__channel . 

DR InterPro; IPR005820; M+channel_nlg . 

DR InterPro; IPR003973; Shab_channel . 

DR Pfam; PF00520; ion_trans; 1. 

DR Pfam; PF02214; K_tetra; 1. 

DR Pfam; PF03521; Kv2 channel; 1. 

DR PRINTS; PR00169; KCHANNEL. 

DR PRINTS; PR01515; KV2 2 CHANNEL . 

DR PRINTS; PRO 14 91; KVCHANNEL. 

DR PRINTS; PR01495; SHABCHANNEL. 

DR SMART; SM00225; BTB; 1. 

KW Transport; Ion transport; Ionic channel; Voltage-gated channel; 

KW Potassium channel; Potassium; Potassium transport; Transmembrane; 

KW Phosphorylation; Glycoprotein; Multigene family. 



FT 


DOMAIN 


1 


190 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


191 


212 


SEGMENT SI (POTENTIAL) . 


FT 


TRANSMEM 


233 


254 


SEGMENT S2 (POTENTIAL) . 


FT 


DOMAIN 


255 


264 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


265 


286 


SEGMENT S3 (POTENTIAL) . 


FT 


TRANSMEM 


299 


320 


SEGMENT S4 (POTENTIAL) . 


FT 


DOMAIN 


321 


334 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


335 


356 


SEGMENT S5 (POTENTIAL) . 


FT 


DOMAIN 


369 


389 


SEGMENT H5 ( PORE- FORMING) (POTENTIAL) 


FT 


TRANSMEM 


397 


418 


SEGMENT S6 (POTENTIAL) . 


FT 


DOMAIN 


419 


>809 


CYTOPLASMIC ( POTENTIAL) . 


FT 


CARBOHYD 


287 


287 


N-LINKED (GLCNAC. . . ) (POTENTIAL). 


FT 


NONJTER 


809 


809 




SQ 


SEQUENCE 


809 AA; 


90602 


MW; E1FF56334F3DA9C8 CRC64 ; 



Query Match 70.7%; Score 29; DB 1; Length 809; 

Best Local Similarity 100.0%; Pred. No. 2.1e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 LVFFAE 6 

I I I I I I 

Db 352 LVFFAE 357 



RESULT 51 
KCB1_M0USE 

ID KCBl_MOUSE STANDARD; PRT; 857 AA. 

AC Q03717; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Potassium voltage-gated channel subfamily B member 1 (Potassium 
DE channel Kv2.1) (mShab) . 
GN KCNB1. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
OX NCBI_TaxID=10090; 
RN [1] 



RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=91162315; PubMed=2002364 ; 

RA Pak M.D. , Covarrubias M., Ratcliffe A. , Salkoff L . ; 

RT "A mouse brain homolog of the Drosophila Shab K+ channel with 

RT conserved delayed-rectif ier properties."; 

RL J. Neurosci. 11:869-880(1991). 

CC -!- FUNCTION: Mediates the voltage-dependent potassium ion 

CC permeability of excitable memb ranes . Channels open or close in 

CC response to the voltage difference across the membrane, letting K+ 

CC ions pass in accordance with their electrochemical gradient. 

CC -!- SUBUNIT: Heteromultimer with KCNG2 , KCNG3 , KCNG4 , KCNS1, KCNS2 r 

CC KCNS3 and KCNV2 (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- DOMAIN: The segment S4 is probably the voltage-sensor and is 
CC characterized by a series of positively charged amino acids at 

CC every third position. 

CC -!- DOMAIN: The tail may be important in modulation of channel 

CC activity and/or targeting of the channel to specific subcellular 

CC compartments . 

CC -!- PTM: Phosphorylated on serine residues in the C-terminal 
CC cytoplasmic tail (By similarity) . 

CC -!- MISCELLANEOUS: Inhibited by tetraethylamitionium chloride (TEA). 

CC -!- SIMILARITY: Belongs to the potassium channel family. B (Shab) 
CC subfamily. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; M64228; AAA40112.1; -. 

DR PIR; 156529; 156529. 

DR HSSP; Q54397; 1BL8 . 

DR MGD; MGI : 96666; Kcnbl. 

DR InterPro; IPR000210; BTB_POZ . 

DR InterPro; IPR005821; Ion_trans . 

DR InterPro; IPR001622; K+channel_pore . 

DR InterPro; IPR003091; K_channel . 

DR InterPro; IPR003131; K_tetra. 

DR InterPro; IPR004350; Kv21channel. 

DR InterPro; IPR003968; Kv_channel . 

DR InterPro; IPR005820; M+channel_nlg . 

DR InterPro; IPR003973; Shab_channel . 

DR Pfam; PF00520; ion_trans; 1. 

DR Pfam; PF02214; K_tetra; 1. 

DR Pfam; PF03521; Kv2 channel; 1. 

DR PRINTS; PR00169; KCHANNEL. 

DR PRINTS; PR01514; KV21CHANNEL. 

DR PRINTS; PR01491; KVCHANNEL. 

DR PRINTS; PR01495; SHABCHANNEL. 

DR SMART; SM00225; BTB; 1. 

KW Transport; Ion transport; Ionic channel; Voltage-gated channel; 

KW Potassium channel; Potassium; Potassium transport; Transmembrane; 



KW 


Phosphorylation; 


Multigene family. 


FT 


DOMAIN 


1 


185 


CYTOPLASMIC f POTENTIAL ) 


FT 


TRANSMEM 


186 


206 


SEGMENT SI ( POTENTIAL 1 


FT 


TRANSMEM 


229 


249 


SEGMENT f POTFNTTAT.'i 


FT 


DOMAIN 


250 


259 


CYTOPLASMIC fPOTFN r TT&"M 
v^- x x w x xjji.»ji ix \_» \ rui XjIM i i nil y . 


FT 


TRANSMEM 


260 


280 


SEGMENT S3 fPOTF.NTTALl 


FT 


TRANSMEM 


295 


315 


SEGMENT S4 (POTENTIAL) . 


FT 


DOMAIN 


316 


330 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


331 


351 


SEGMENT S5 (POTENTIAL) . 


FT 


DOMAIN 


365 


385 


SEGMENT H5 (PORE-FORMING) (POTENTIAL) . 


FT 


TRANSMEM 


393 


413 


SEGMENT S6 (POTENTIAL) . 


FT 


DOMAIN 


414 


857 


CYTOPLASMIC (POTENTIAL) . 


FT 


DOMAIN 


517 


520 


POLY-SER. 


SQ 


SEQUENCE 


857 AA 


.; 95692 


MW; 772F1C42C25EF3A5 CRC64; 



Query Match 70.7%; Score 29; DB 1; Length 857; 

Best Local Similarity 100.0%; Pred. No. 2.2e+02; 
Matches 6; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 

Db 



1 LVFFAE 6 

I I I I I I 
34 8 LVFFAE 353 



RESULT 52 
KCB1_RAT 

ID KCB1_RAT STANDARD; PRT; 857 AA. 

AC P15387; 

DT 01-APR-1990 (Rel. 14, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Potassium voltage-gated channel subfamily B member 1 (Potassium 

DE channel Kv2.1) (DRK1) . 

GN KCNBl. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE OF 4-857 FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE-89365157; PubMed=27708 68 ; 

RA Freeh G.C., Vandongen A.M.J. , Schuster G. , Brown A.M., Joho R.H.; 

RT "A novel potassium channel with delayed rectifier properties isolated 

RT from rat brain by expression cloning."; 

RL Nature 340:642-645(198 9). 

RN [2] 

RP REVISIONS. 

RA Freeh G.C. ; 

RL Submitted (FEB-1990) to the EMBL/GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE OF 1-575 FROM N.A. 

RX MEDLINE=92156897; PubMed=1740690; 

RA Drewe J. A., Verma S., Freeh G.C., Joho R.H.; 

RT "Distinct spatial and temporal expression patterns of K+ channel mRNAs 

RT from different subfamilies."; 

RL J. Neurosci. 12:538-54 8(1992). 



RN [4] 

RP PHOSPHORYLATION, AND MUTAGENESIS OF SER-444 AND SER-496. 

RX MEDLINE-98016256; PubMed=9351973; 

RA Murakoshi H., Shi G. , Scannevin R.H., Trimmer J.S.; 

RT "Phosphorylation of the Kv2 . 1 K+ channel alters voltage-dependent 

RT activation."; 

RL Mol. Pharmacol. 52:821-828(1997). 

CC -!- FUNCTION: Mediates the voltage-dependent potassium ion 

CC permeability of excitable membranes. Channels open or close in 

CC response to the voltage difference across the membrane, letting K+ 

CC ions pass in accordance with their electrochemical gradient. 

CC -!- SUBUNIT: Heter omul timer with KCNG2 , KCNG3, KCNG4 , KCNS1, KCNS2 , 

CC KCNS3 and KCNV2 (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- TISSUE SPECIFICITY: In the brain, the greatest density occurs in 
CC the cerebral cortex, followed by the hippocampus, cerebellum, and 

CC olfactory bulb. In peripheral tissues it is most prominent in 

CC retina and kidney. Also present in cardiac muscle tissue of the 

CC atrium and ventricle and in skeletal muscle. 

CC -!- DOMAIN: The segment S4 is probably the voltage-sensor and is 
CC characterized by a series of positively charged amino acids at 

CC every third position. 

CC -!- DOMAIN: The tail may be important in modulation of channel 

CC activity and/or targeting of the channel to specific subcellular 

CC compartments . 

CC -!- PTM: Phosphorylated on serine residues in the C-terminal 
CC cytoplasmic tail, essentially between residues 667 and 853. 

CC -!- SIMILARITY: Belongs to the potassium channel family. B (Shab) 
CC subfamily. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

cc 

DR EMBL; X16476; CAA34497.1; ALT_INIT. 

DR PIR; S05448; CHRTD1 . 

DR HSSP; Q54397; 1BL8 . 

DR InterPro; IPR000210; BTB_POZ . 

DR InterPro; IPR005821; Ion Jrans . 

DR InterPro; IPR001622; K+channel_pore . 

DR InterPro; IPR003091; K_channel . 

DR InterPro; IPR003131; K_tetra. 

DR InterPro; IPR004350; Kv21channel . 

DR InterPro; IPR003968; Kv_channel. 

DR InterPro; IPR005820; M+channel_nlg. 

DR InterPro; IPR003973; Shab_channel . 

DR Pfam; PF00520; ion_trans; 1. 

DR Pfam; PF02214; K_tetra; 1. 

DR Pfam; PF03521; Kv2 channel; 1. 

DR PRINTS; PR00169; KCHANNEL. 

DR PRINTS; PR01514; KV2 1 CHANNEL . 

DR PRINTS; PR014 91; KVCHANNEL. 

DR PRINTS; PR01495; SHABCHANNEL. 



DR 


SMART; SM00225; BTB; 1. 




KW 


Transports- 


Ion transport; Ionic channel; Voltage-gated channel; 


KW 


Potassium 


channel ; 


Potassium; 


Potassium transport; Transmembrane; 


KW 


Phosphorylation; Multigene family. 


FT 


DOMAIN 


1 


186 

X. \J \J 


CYTOPLASMIC ( POTF.NTT AT, ) 


FT 


TRANSMEM 


187 


208 


SEGMENT Si (POTFNTTAT,} 


FT 


TRANSMEM 


229 


250 


SEGMENT S2 f POTENTIAL ) 


FT 


DOMAIN 


251 


260 


CYTOPLASMIC (POTENTIATE 


FT 

x x 


TRANSMEM 


261 


282 


SEGMENT S3 (POTENTIAL) . 


FT 


TRANSMEM 


295 


316 


SEGMENT S4 (POTENTIAL) . 


FT 


DOMAIN 


317 


330 


CYTOPLASMIC f POTENTIALS 

V-** X X \^ X ' ' ' Lkj' L XX V_> \ X \_/ X J 111 X J-iiXI / * 


FT 


TRANSMEM 


331 




O Xli'CTl illllN 1 ►_> ~J \ XI \J X UllM 1 ±/\XJ J . 


FT 


DOMAIN 


365 


385 


SEGMENT H5 ( PORE- FORMING ) (POTENTIAL). 


FT 


TRANSMEM 


393 


414 


SEGMENT S6 (POTENTIAL) . 


FT 


DOMAIN 


415 


857 


CYTOPLASMIC (POTENTIAL) . 


FT 


DOMAIN 


517 


520 


POLY-SER. 


FT 


DOMAIN 


700 


705 


POLY-ALA. 


FT 


MUTAGEN 


444 


444 


S->A: NO EFFECT ON PHOSPHORYLATION. 


FT 


MUTAGEN 


496 


496 


S->A: NO EFFECT ON PHOSPHORYLATION. 


SQ 


SEQUENCE 


857 AA; 


95637 MW; 


B3C5B0839AB15FD0 CRC64; 


Query Match 




70.7%; 


Score 29; DB 1; Length 857; 


Best Local Similarity 


100.0%; 


Pred. No. 2.2e+02; 



Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

1 LVFFAE 6 

I I I I I I 
348 LVFFAE 353 



Qy 

Db 



RESULT 53 




KCB1 


JiUMAN 




ID 


KCB1 HUMAN STANDARD; PRT; 858 AA. 




AC 


Q14721; Q14193; 




DT 


15-JUL-1998 (Rel. 36, Created) 




DT 


28-FEB-2003 (Rel. 41, Last sequence update) 




DT 


28-FEB-2003 (Rel. 41, Last annotation update) 




DE 


Potassium voltage-gated channel subfamily B member 


1 ( Potassium 


DE 


channel Kv2.1) (h-DRKl) . 




GN 


KCNB1. 




OS 


Homo sapiens (Human) . 




OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 


Euteleostomi ; 


OC 


Mammalia; Eutheria; Primates; Catarrhini; Hominidae 


; Homo . 


OX 


NCBI TaxID=9606; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RX 


MEDLINE=94363205; PubMed-8 081723 ; 




RA 


Albrecht B., Lorra C, Stocker K., Pongs O. ; 




RT 


"Cloning and characterization of a human delayed rectifier potass 


RT 


channel gene."; 




RL 


Recept. Channels 1:99-110(1993). 




RN 


[2] 




RP 


SEQUENCE FROM N.A. 




RA 


Ikeda S.R., Soler F., Zuhlke R.D., Joho R.H., Lewis 


D . L . ; 


RL 


Submitted (JAN-1993) to the EMBL/ GenBank/DDBJ databases. 


RN 


[3] 




RP 


SEQUENCE FROM N.A. 





RC TISSUE=Lens epithelium; 

RA Rae J.L., Shepard A.R.; 

RT "Identification of potassium channels in human lens epithelium."; 

RL (In) Civan M.M. (eds.); 

RL The eye's aqueous humor-from secretion to glaucoma, pp. 69-104, 

RL Academic Press, San Diego (1998). 

RN [4] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=21638749; PubMed=11780052 ; 

RA Deloukas P., Matthews L.H., Ashurst J., Burton J. , Gilbert J.G.R., 

RA Jones M. , Stavrides G. , Almeida J. P., Babbage A.K., Bagguley C.L., 

RA Bailey J., Barlow K.F., Bates K.N., Beard L.M., Beare D.M., 

RA Beasley O.P., Bird CP., Blakey S.E., Bridgeman A.M. , Brown A. J., 

RA Buck D., Burrill W. , Butler A. P., Carder C, Carter N.P., 

RA Chapman J.C., Clamp M. , Clark G., Clark L.N., Clark S.Y., Clee CM., 

RA Clegg S., Cobley V.E., Collier R.E., Connor R. , Corby N.R., 

RA Coulson A., Coville G.J., Deadman R. , Dhami P., Dunn M. , 

RA Ellington A.G., Frankland J. A., Fraser A., French L., Garner P., 

RA Grafham D.V., Griffiths C, Griffiths M.N.D., Gwilliam R. , Hall R.E., 

RA Hammond S., Harley J.L., Heath P.D., Ho S., Holden J.L., Howden P.J., 

RA Huckle E., Hunt A.R., Hunt S.E., Jekosch K., Johnson CM., Johnson D., 

RA Kay M.P., Kimberley A.M., King A., Knights A., Laird G.K., Lawlor S., 

RA Lehvaslaiho M.H., Leversha M. , Lloyd C, Lloyd D.M., Lovell J.D., 

RA Marsh V.L., Martin S.L., McConnachie L.J., McLay K. , McMurray A. A. , 

RA Milne S., Mistry D., Moore M.J.F., Mullikin J.C, Nickerson T., 

RA Oliver K., Parker A., Patel R. , Pearce T.A.V., Peck A. I., 

RA Phillimore B.J.C.T., Prathalingam S.R., Plumb R.W., Ramsay H., 

RA Rice CM., Ross M.T., Scott C.E., Sehra H.K., Shownkeen R. , Sims S., 

RA Skuce CD., Smith M.L., Soderlund C, Steward C A. , Sulston J.E., 

RA Swann M. , Sycamore N . , Taylor R., Tee L., Thomas D.W., Thorpe A., 

RA Tracey A., Tromans A.C, Vaudin M. , Wall M. , Wallis J.M., 

RA Whitehead S.L., Whittaker P., Willey D.L., Williams L., Williams S.A., 

RA Wilming L., Wray P.W., Hubbard T . , Durbin R.M. , Bentley D.R., Beck S., 

RA Rogers J. ; 

RT "The DNA sequence and comparative analysis of human chromosome 20."; 

RL Nature 414:865-871(2001). 

CC -!- FUNCTION: Mediates the voltage-dependent potassium ion 

CC permeability of excitable membranes. Channels open or close in 

CC response to the voltage difference across the membrane, letting K+ 

CC ions pass in accordance with their electrochemical gradient. 

CC -!- SUBUNIT: Heter omul timer with KCNG2, KCNG3, KCNG4, KCNS1, KCNS2, 

CC KCNS3 and KCNV2 (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- DOMAIN: The segment S4 is probably the voltage-sensor and is 
CC characterized by a series of positively charged amino acids at 

CC every third position. 

CC - -!- DOMAIN: The tail may be important in modulation of channel 

CC activity and/or targeting of the channel to specific subcellular 

CC compartments . 

CC -!- PTM: Phosphorylated on serine residues in the C-terminal 
CC cytoplasmic tail (By similarity) . 

CC -!- SIMILARITY: Belongs to the potassium channel family. B (Shab) 
CC subfamily. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 



cc 

X— * 


use by non-profit institutions as long as its content is in no way- 


cc 


modified and this statement is not removed. Usage by 


and for commercial 


cc 

X_> 


entities requires a license agreement (See http://www. 


isb-sib . ch/ announce/ 


cc 
cc 

DR 

X-/ X \ 


or send an email to license@isb— sib . ch) . 




EMBL; X68302; CAA48374.1; 




DR 


EMBL; L02840; AAA36156.1; ALT INIT. 




DR 


EMBL; AF026005; AAB88808.1; 




DR 


EMBL; AL035685; CAB8 9417.1; 




DR 


PIR; S31761; S31761. 




DR 


HSSP; 054397; 1BL8 . 




DR 


Genew; HGNC:6231; KCNB1 . 




DR 


MIM; 600397; -. 

A A X X f V X** X^ ^rf" If * 




DR 


InterPro; IPR000210; BTB POZ . 

X. 11 <v X. X X. V**/ / X, X X \ v x/ v X* -X x/ f ±J X X-J X v XI * 




HR 


InterPro; IPR005821; Ion trans. 




DR 


InterPro; IPR001622; K+channel oore . 

' X X s « _1 X- _l f _1_ X_ X X \J _1_ £—m * *■ w XXI X X L^4. X X X X X k..^ W X V m • 




DR 

LTV 


InterPro; IPR003091; K channel. 

X* X 1- x*» V X» X X» -1* X X X \J \y X-/ — ^ X» ^ X X x X X X X X x- • 




DR 


InterPro; IPR003131; K tetra. 




DR 


InterPro; IPR004350; Kv21channel 

x x x v^r v_- x. x x. v/ / _i_ x x x v./ w -^v vy ^ x v v t—» _i_ oi x u x xx x x, » 




DR 


InterPro; IPR003968; Kv channel. 

-X> X X X*- »w J_ X. J_ X^ ^ _L. X. X X Xj* xy ■^w' ' X** XV ^ XXV X^ X X wt. X X X X X*^- • 




DR 


InterPro; IPR005820; M+channel nla 

X- X X V^> X- X X V^r f X* X X x \J \-J v> v t—t V/ ^ X X 1 xy X X ^-X XX X X x_<- X. X X X. X»4 * 




DR 


InterPro; IPR003973; Shab channel. 

-X> X X X^- X_-» J_ X. X_ X^ J _L. X> X X xy \/ t j U X X ±jL 3*J X^X X X^X X X X X X^ *■ _ A 




DR 


Pfam; PF00520; ion trans; 1. 




DR 


Pfam; PF02214; K tetra; 1. 




DR 


Pfam; PF03521; Kv2 channel; 1. 




DR 

x_s x x 


PRINTS; PR00169; KCHANNEL. 




DR 


PRINTS; PR01514; KV2 1CHANNEL . 




DR 


PRINTS; PR01491; KVCHANNEL. 




DR 


PRINTS; PR01495; SKABCKANNEL . 




DR 


SMART; SM00225; BTB; 1. 




KW 


Transport; Ion transport; Ionic channel; Voltage-gated 


channel ; 


KW 


Potassium channel; Potassium; Potassium transport; Transmembrane; 


KW 


Phosphorylation; Multigene family. 




FT 


DOMAIN 1 18 6 CYTOPLASMIC (POTENTIAL) . 




FT 


TRANSMEM 187 208 SEGMENT SI (POTENTIAL). 




FT 

X X 


TRANSMEM 229 250 SEGMENT S2 (POTENTIAL) . 




FT 

X_ X 


DOMAIN 251 260 CYTOPLASMIC (POTENTIAL). 




FT 

x_ j_ 


TRANSMEM 261 282 SEGMENT S3 ( POTENTIAL) . 




FT 


TRANSMEM 295 316 SEGMENT S4 (POTENTIAL) . 




E 1 


DOMAIN 317 330 CYTOPLASMIC (POTENTIAL) . 




FT 


TRANSMEM 331 352 SEGMENT S5 (POTENTIAL) . 




FT 


DOMAIN 365 385 SEGMENT H5 ( PORE- FORMING) 


(POTENTIAL) . 


FT 


TRANSMEM 393 414 SEGMENT S6 (POTENTIAL) . 




FT 


DOMAIN 415 858 CYTOPLASMIC (POTENTIAL) . 




FT 


DOMAIN 517 520 POLY-SER. 




FT 


DOMAIN 701 706 POLY- ALA. 




SQ 


SEQUENCE 858 AA; 95877 MW; C4B42 6174ED0DEE4 CRC64 ; 





Query Match 70.7%; Score 29; DB 1; Length 858; 

Best Local Similarity 100.0%; Pred. No. 2.2e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 LVFFAE 6 

I I I I I I 

Db 34 8 LVFFAE 353 



RESULT 54 
KCB1 PIG 



ID KCB1_PIG STANDARD; PRT; 858 AA. 

AC 018868; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Potassium voltage-gated channel subfamily B member 1 (Potassium 

DE channel Kv2.1) (DRK1 ) . 

GN KCNB1 . 

OS Sus scrofa (Pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla; Suina; Suidae; Sus. 

OX NCBI_TaxID=9823; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Lens epithelium; 

RA Rae J.L., Shepard A.R.; 

RL Submitted (SEP-1997) to the EMBL/ GenBank/DDB J databases. 

CC -!- FUNCTION: Mediates the voltage-dependent potassium ion 

CC permeability of excitable membranes. Channels open or close in 

CC response to the voltage difference across the membrane, letting K+ 

CC ions pass in accordance with their electrochemical gradient (By 

CC similarity) . 

CC -!- SUBUNIT: Heteromultimer with KCNG2 , KCNG3, KCNG4, KCNS1, KCNS2, 
CC KCNS3 and KCNV2 (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- DOMAIN: The segment S4 is probably the voltage-sensor and is 
CC characterized by a series of positively charged amino acids at 

CC every third position. 

CC -!- DOMAIN: The tail may be important in modulation of channel 

CC activity and/or targeting of the channel to specific subcellular 

CC compartments . 

CC -!- PTM: Phosphorylated on serine residues in the C-terminal 
CC cytoplasmic tail (By similarity) . 

CC -!- SIMILARITY: Belongs to the potassium channel family. B (Shab) 
CC subfamily. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF026006; AAB88809.1; -. 

DR HSSP; Q54397; 1BL8 . 

DR InterPro; IPR000210; BTB_POZ . 

DR InterPro; IPR005821; Ionjrans. 

DR InterPro; IPR001622; K+channel_pore . 

DR InterPro; IPR003091; K_channel . 

DR InterPro; IPR003131; K_tetra. 

DR InterPro; IPR004350; Kv21channel. 

DR InterPro; IPR003968; Kv_channel. 

DR InterPro; IPR005820; M+channel_nlg . 

DR InterPro; IPR003973; Shab channel. 



n d 
UK 


Pfam; PF00520; ion_trans; 1 


• 






UK 


Pfam; PF02214; K 


_tetra; 1. 








UK 


Pfam; PF03521; Kv2 channel; 


1. 






UK 


PRINTS; PR00169; 


KCHANNEL . 








UK 


PRINTS; PR01514; 


KV21 CHANNEL. 






UK 


PRINTS; PR01491; 


KVCHANNEL . 








UK 


PRINTS; PR01495; 


SHABCHANNEL. 






UK 


SMART; SM00225; 


BTB; 1. 








KW 


Transport; 


Ion transport; Ionic channel; 


Voltage-gated 


channel ; 


KW 


Potassium 


channel; Potassium; Potassium 


transport; Transmembrane; 


KW 


Phosphorylation; 


Multigene 


family. 






rip II 
t 1 


DOMAIN 


1 


185 


CYTOPLASMIC 


(POTENTIAL) . 




CP 1 1 


TRANSMEM 


186 


206 


SEGMENT SI 


(POTENTIAL) . 




VIP I 1 

FT 


TRANSMEM 


229 


249 


SEGMENT S2 


(POTENTIAL) . 




FT 


DOMAIN 


250 


259 


CYTOPLASMIC 


(POTENTIAL) . 




FT 


TRANSMEM 


260 


280 


SEGMENT S3 


(POTENTIAL) . 




TTirn 

r I 


TRANSMEM 


295 


315 


SEGMENT S4 


(POTENTIAL) . 




FT 


DOMAIN 


316 


330 


CYTOPLASMIC 


(POTENTIAL) . 




FT 


TRANSMEM 


331 


351 


SEGMENT S5 


(POTENTIAL) . 




FT 


DOMAIN 


365 


385 


SEGMENT H5 


(PORE- FORMING) 


(POTENTIAL) . 


FT 


TRANSMEM 


393 


413 


SEGMENT S6 


(POTENTIAL) . 




FT 


DOMAIN 


414 


858 


CYTOPLASMIC 


(POTENTIAL) . 




FT 


DOMAIN 


517 


520 ■ 


POLY-SER. 






FT 


DOMAIN 


701 


706 


POLY- ALA. 






SQ 


SEQUENCE 


858 AA; 96117 MW; A9E24C3A8E13B4 91 CRC64; 




Query Match 




70.7%; 


Score 29; 


DB 1; Length 


858; 



Best Local Similarity 100.0%; Pred. No. 2.2e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 LVFFAE 6 

I I I I I I 

Db 348 LVFFAE 353 



RESULT 55 
KCB1_RABIT 

ID KCB1_RABIT STANDARD; PRT; 858 AA. 

AC Q9MZ19; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Potassium voltage-gated channel subfamily B member 1 (Potassium 

DE channel Kv2 . 1) . 

GN KCNB1 . 

OS Oryctolagus cuniculus (Rabbit) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

OX NCBIJTaxID=9986; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=New Zealand white; TISSUE=Corneal endothelium; 

RA Rae J.L.; 

RL Submitted (MAY-2000) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: Mediates the voltage-dependent potassium ion 

CC permeability of excitable membranes . Channels open or close in 

CC response to the voltage difference across the membrane, letting 
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ions pass in accordance with their electrochemical gradient (By 
similarity) . 

SUBUNIT: Heter omul timer with KCNG2, KCNG3, KCNG4 r KCNS1, KCNS2, 
KCNS3 and KCNV2 (By similarity) . 

SUBCELLULAR LOCATION: Integral membrane protein. 
DOMAIN: The segment S4 is probably the voltage-sensor and is 
characterized by a series of positively charged amino acids at 
every third position. 

DOMAIN: The tail may be important in modulation of channel 
activity and/or targeting of the channel to specific subcellular 
compartments . 

PTM: Phosphorylated on serine residues in the C-terminal 
cytoplasmic tail (By similarity) . 

SIMILARITY: Belongs to the potassium channel family. B (Shab) 
subfamily . 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; AF266507; AAF77058.1; -. 
HSSP; Q54397; 1BL8- 
InterPro; IPR000210 
Inter Pro; IPR005821 
InterPro; IPR001622 
InterPro; IPR003091 
InterPro; IPR003131 
InterPro; IPR004350 
InterPro; IPR003968 
InterPro; IPR005820 
InterPro; IPR003973; _ 
Pfam; PF00520; ion_trans; 1. 
Pfam; PF02214; K_tetra; 1. 
Pfam; PF03521; Kv2 channel; 1. 
PRINTS; PR00169; KCHANNEL. 
PRINTS; PR01514; KV2 1CHANNEL . 
PRINTS; PR01491; KVCHANNEL. 
PRINTS; PR01495; SHABCHANNEL. 
SMART; SM00225; BTB; 1. 

Transport; Ion transport; Ionic channel; Voltage-gated channel; 
Potassium channel; Potassium; Potassium transport; Transmembrane; 



BTB_POZ . 
Ion_trans . 
K+channel_pore . 
K_channel . 
K_tetra. 
Kv21 channel . 
Kv_channel . 
M+channel__nlg . 
Shab channel . 



KW 


Phosphorylation; 


Multigene 


family. 


FT 


DOMAIN 


1 


185 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


186 


206 


SEGMENT SI (POTENTIAL) . 


FT 


TRANSMEM 


229 


249 


SEGMENT S2 (POTENTIAL). 


FT 


DOMAIN 


250 


259 


CYTOPLASMIC (POTENTIAL). 


FT 


TRANSMEM 


260 


280 


SEGMENT S3 (POTENTIAL) . 


FT 


TRANSMEM 


295 


315 


SEGMENT S4 (POTENTIAL) . 


FT 


DOMAIN 


316 


330 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


331 


351 


SEGMENT S5 (POTENTIAL) . 


FT 


DOMAIN 


365 


385 


SEGMENT H5 ( PORE- FORMING) (POTENTIAL) . 


FT 


TRANSMEM 


393 


413 


SEGMENT S6 (POTENTIAL) . 


FT 


DOMAIN 


414 


858 


CYTOPLASMIC (POTENTIAL) . 



FT 
FT 
SQ 



DOMAIN 
DOMAIN 
SEQUENCE 



517 520 POLY-SER. 
701 706 POLY-ALA. 
858 AA; 95373 MW; FDFDCC49B6973C4B CRC64; 



Query Match 70.7%; Score 29; DB 1; Length 858; 

Best Local Similarity 100.0%; Pred. No. 2.2e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 LVFFAE 6 

I I I I I I 

Db 348 LVFFAE 353 



RESULT 56 
KCB2 RAT 



ID KCB2_RAT STANDARD; PRT; 907 AA. 

AC Q63099; 

DT 15-JUL-1998 (Rel. 36, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Potassium voltage-gated channel subfamily B member 2 (Potassium 

DE channel Kv2 . 2 ) (CDRK) . 

GN KCNB2 . 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE^Brain; 

RX MEDLINE=92198655; PubMed=1550672 ; 

RA Hwang P.M., Glatt C.E., Bredt D.S., Yellen G. , Snyder S.H.; 

RT "A novel K+ channel with unique localizations in mammalian brain: 

RT molecular cloning and characterization."; 

RL Neuron 8:473-481(1992). 

RN [2] 

RP CONCEPTUAL TRANSLATION. 

RA Sundaram S . ; 

RL Unpublished observations (OCT-2002) . 

CC -!- FUNCTION: Mediates the voltage-dependent potassium ion 

CC permeability of excitable membranes. Channels open or close in 

CC response to the voltage difference across the membrane, letting 

CC ions pass in accordance with their electrochemical gradient. 

CC -!- SUBUNIT: Heteromultimer with KCNSl, KCNS2 and KCNS3 (By 

CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- TISSUE SPECIFICITY: In the brain, the greatest density occurs in 

CC the olfactory bulb, followed by the cerebral cortex, hippocampus 

CC and cerebellum. In peripheral tissues it is most prominent in 

CC whole tongue epithelium and circumvallate papillae. 

CC -!- DOMAIN: The segment S4 is probably the voltage-sensor and is 

CC characterized by a series of positively charged amino acids at 

CC every third position. 

CC -!- DOMAIN: The tail may be important in modulation of channel 

CC activity and/or targeting of the channel to specific subcellular 

CC compartments . 

CC -!- PTM: Phosphorylated (Potential). 
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DR 
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-!- SIMILARITY: Belongs to the potassium channel family. B (Shab) 
subfamily. 

-!- CAUTION: This is a conceptual translation, a frameshift was 
introduced in position 755 to restore the similarity with the 
other orthologs. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; M77482; AAA40905.1; ALT_FRAME. 
HSSP; Q54397; 1BL8 . 
InterPro; IPR000210 
InterPro; IPR005821 
InterPro; IPR001622 
InterPro; IPR003091 
InterPro; IPR003131 
InterPro; IPR005826 
InterPro; IPR003968 
InterPro; IPR00582 0 
InterPro; IPR003973 _ 
Pfam; PF00520; ion_trans; 1. 
Pfam; PF02214; K_tetra; 1. 
Pfam; PF03521; Kv2 channel; 1. 
PRINTS; PR00169; KCHANNEL. 
PRINTS; PR01515; KV2 2 CHANNEL . 
PRINTS; PR01491; KVCHANNEL . 
PRINTS; PR01495; SHABCHANNEL. 
SMART; SM00225; BTB; 1. 

Transport; Ion transport; Ionic channel; Voltage-gated channel; 
Potassium channel; Potassium; Potassium transport; Transmembrane; 
Phosphorylation; Glycoprotein; Multigene family. 



BTB_POZ . 
Ion_trans . 
K+channel_pore . 
K__channel . 
K_tetra . 
Kv2 2 channel . 
Kv^channel . 
M+channel_jilg . 
Shab channel . 
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CYTOPLASMIC (POTENTIAL) . 
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TRANSMEM 
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212 


SEGMENT SI (POTENTIAL) . 
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TRANSMEM 
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254 


SEGMENT S2 (POTENTIAL). 


FT 


DOMAIN 


255 


264 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


265 


286 


SEGMENT S3 (POTENTIAL). 


FT 


TRANSMEM 


299 


320 


SEGMENT S4 (POTENTIAL) . 


FT 


DOMAIN 


321 


334 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


335 


356 


SEGMENT S5 (POTENTIAL) . 


FT 


DOMAIN 


369 


389 


SEGMENT H5 ( PORE- FORMING) (POTENTIAL) . 


FT 


TRANSMEM 


397 


418 


SEGMENT S6 (POTENTIAL) . 


FT 


DOMAIN 


419 


907 


CYTOPLASMIC (POTENTIAL) . 


FT 


CARBOHYD 


287 


287 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


SQ 


SEQUENCE 


907 AA; 


102095 


MW; B242D9A6753A1295 CRC64; 



Query Match 70.7%; Score 29; DB 1; Length 907; 

Best Local Similarity 100.0%; Pred. No. 2.3e+02; 
Matches 6; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 

Db 



1 LVFFAE 6 

I I II I I 
352 LVFFAE 357 



1 



RESULT 57 
KCB2 HUMAN 



ID KCB2_HUMAN STANDARD; PRT; 911 AA. 

AC Q92953; Q9BXD3; 

DT 15-DEC-1998 (Rel. 37, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Potassium voltage-gated channel subfamily B member 2 (Potassium 

DE channel Kv2 . 2 ) . 

GN KCNB2 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxI D= 9 6 0 6 ; 

RN [1] 

RP SEQUENCE OF 1-811 FROM N.A. 

RX MEDLINE=98275219; PubMed-9612272 ; 

RA Schmalz F. , Kinsella J., Koh S.D., Vogalis F., Schneider A., 

RA Flynn E.R., Kenyon J.L., Horowitz B.; 

RT "Molecular identification of a component of delayed rectifier current 

RT in gastrointestinal smooth muscles."; 

RL Am. J. Physiol. 274 : G901-G911 ( 1998 ) . 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Rae J.L.; 

RL Submitted (JAN-2001) to the EMBL/GenBank/DDB J databases. 

CC -!- FUNCTION: Mediates the voltage-dependent potassium ion 

CC permeability of excitable membranes. Channels open or close in 

CC response to the voltage difference across the membrane, letting K+ 

CC ions pass in accordance with their electrochemical gradient. 

CC -!- SUBUNIT: Heteromultimer with KCNS1, KCNS2 and KCNS3 (By 

CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- DOMAIN: The segment S4 is probably the voltage-sensor and is 
CC characterized by a series of positively charged amino acids at 

CC every third position. 

CC -!- DOMAIN: The tail may be important in modulation of channel 

CC activity and/or targeting of the channel to specific subcellular 

CC compartments . 

CC -!- PTM: Phosphorylated (Potential). 

CC -!- SIMILARITY: Belongs to the potassium channel family. B (Shab) 
CC subfamily. 

CC -!- CAUTION: Ref.l sequence differs from that shown due to a 
CC frameshift in position 768. 

CC - 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

CC 

DR EMBL; U69962; AAB08433.1; ALT_FRAME. 

DR EMBL; AF338730; AAK16585.1; -. 



DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
KW 
KW 



HSSP; Q54397; 1BL8 . 
Genew; HGNC:6232; KCNB2 . 
MIM; 607738; 

GO; GO: 0008076; C : voltage-gated potassium channel complex; TAS . 

GO; GO:0005251; F:delayed rectifier potassium channel activity; ■ TAS 

GO; GO:0006813; P:potassium ion transport; TAS . 

GO; GO: 0006940; P: regulation of smooth muscle contraction; TAS. 



BTB__POZ . 
Ion__trans . 
K+channel_pore . 
K_channel . 
K_tetra . 
Kv22 channel . 
Kv_channel . 
M+channel_nlg . 
Shab channel . 



InterPro; IPR000210 
InterPro; IPR005821 
InterPro; IPR001622 
InterPro; IPR003091 
InterPro; IPR003131 
InterPro; IPR005826 
InterPro; IPR003968 
InterPro; I PRO 0582 0 
InterPro; IPR003973 _ 
Pfam; PF00520; ion_trans; 1. 
Pfam; PF02214; K_tetra; 1. 
Pfam; PF03521; Kv2 channel; 1. 
PRINTS; PRO 01 69; KCHANNEL. 
PRINTS; PR01515; KV2 2 CHANNEL . 
PRINTS; PR01491; KVCHANNEL. 
PRINTS; PR01495; SHABCHANNEL . 
SMART; SM00225; BTB; 1. 

Transport; Ion transport; Ionic channel; Voltage-gated channel; 
Potassium channel; Potassium; Potassium transport; Transmembrane; 
Phosphorylation; Glycoprotein; Multigene family. 



FT 


DOMAIN 


1 


190 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


191 


212 


SEGMENT SI (POTENTIAL). 


FT 


TRANSMEM 


233 


254 


SEGMENT S2 (POTENTIAL) . 


FT 


DOMAIN 


255 


264 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


265 


286 


SEGMENT S3 (POTENTIAL) . 


FT 


TRANSMEM 


299 


320 


SEGMENT S4 (POTENTIAL) . 


FT 


DOMAIN 


321 


334 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


335 


356 


SEGMENT S5 (POTENTIAL) . 


FT 


DOMAIN 


369 


389 


SEGMENT H5 ( PORE- FORMING) ( POTENTIAL) . 


FT 


TRANSMEM 


397 


418 


SEGMENT S6 (POTENTIAL) . 


FT 


DOMAIN 


419 


911 


CYTOPLASMIC ( POTENTIAL) . 


FT 


CARBOHYD 


287 


287 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


SQ 


SEQUENCE 


911 AA; 


102562 


MW; 2261D7D6280CF81A CRC64; 



Query Match 70.7%; Score 29; DB 1; Length 911; 

Best Local Similarity 100.0%; Pred. No. 2.3e+02; 
Matches 6; Conservative 0; Mismatches 0; Indels 



0; Gaps 



Qy 

Db 



1 LVFFAE 6 

I I I I I I 
352 LVFFAE 357 



RESULT 58 
KCB2_RABIT 

ID KCB2_RABIT STANDARD; PRT; 911 AA. 

AC Q95L11; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 



DE Potassium voltage-gated channel subfamily B member 2 (Potassium 

DE channel Kv2 . 2 ) . 

GN KCNB2 . 

OS Oryctolagus cuniculus (Rabbit) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

OX NCBI_TaxID=9986; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=2 22 86512; PubMed- 12399537; 

RA Malysz J., Farrugia G., Ou Y. , Szurszewski J.H., Nehra A. , 

RA Gibbons S . J. ; 

RT "The Kv2.2 alpha subunit contributes to delayed rectifier K(+) 

RT currents in myocytes from rabbit corpus cavernosum."; 

RL J. Androl. 23:8 99-910(2 002). 

CC -!- FUNCTION: Mediates the voltage-dependent potassium ion 

CC permeability of excitable membranes. Channels open or close in 

CC response to the voltage difference across the membrane, letting K+ 

CC ions pass in accordance with their electrochemical gradient (By 

CC similarity) . 

CC -!- SUBUNIT: Heteromultimer with KCNS1, KCNS2 and KCNS3 (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 
CC DOMAIN: The segment S4 is probably the voltage-sensor and is 

CC characterized by a series of positively charged amino acids at 

CC every third position. 

CC -!- DOMAIN: The tail may be important in modulation of channel 

CC activity and/or targeting of the channel to specific subcellular 

CC compartments . 

CC -!- PTM: Phosphorylated (Potential). 

CC -!- SIMILARITY: Belongs to the potassium channel family. B (Shab) 
CC subfamily. 

cc 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AY037947; AAK84954.1; -. 

DR InterPro; IPR000210; BTB_POZ . 

DR InterPro; IPR005821; Ion_trans . 

DR InterPro; IPR001622; K+channel_pore . 

DR InterPro; IPR003091; K_channel . 

DR InterPro; IPR003131; K_tetra . 

DR InterPro; IPR005826; Kv22channel. 

DR InterPro; IPR003968; Kv_channel. 

DR InterPro; IPR005820; M+channel_nlg . 

DR InterPro; IPR003973; Shab_channel . 

DR Pfam; PF00520; ion_trans; 1. 

DR Pfam; PF02214; K_tetra; 1. 

DR Pfam; PF03521; Kv2channel; 1. 

DR PRINTS; PR00169; KCHANNEL. 

DR PRINTS; PR01515; KV2 2 CHANNEL . 

DR PRINTS; PR01491; KVCHANNEL. 



FID 


PRINTS; PR01495; 


SHABCHANNEL. 




Urs. 


SMART; SM00225; 


BTB; 1. 






Transport; 


Ion transport; Ionic channel; Voltage-gated channel; 




Potassium 


channel; Potassium; 


Potassium transport; Transmembrane; 


rvW 


Phosphorylation; 


Glycoprotein; Multigene family. 


r 1 


DOMAIN 


1 


189 


CYTOPLASMIC (POTENTIAL) . 


r 1 


TRANSMEM 


190 


210 


SEGMENT SI (POTENTIAL) . 


r 1 


TRANSMEM 


233 


253 


SEGMENT S2 (POTENTIAL) . 


r 1 


DOMAIN 


254 


263 


CYTOPLASMIC (POTENTIAL) . 


r 1 


TRANSMEM 


264 


284 


SEGMENT S3 (POTENTIAL) . 


FT 


TRANSMEM 


299 


320 


SEGMENT S4 (POTENTIAL) . 


FT 


DOMAIN 


321 


334 


CYTOPLASMIC (POTENTIAL) . 


FT 


■ TRANSMEM 


335 


356 


SEGMENT S5 (POTENTIAL) . 


FT 


DOMAIN 


369 


389 


SEGMENT H5 ( PORE- FORMING) (POTENTIAL). 


FT 


TRANSMEM 


397 


418 


SEGMENT S6 (POTENTIAL) . 


FT 


DOMAIN 


419 


911 


CYTOPLASMIC (POTENTIAL) . 


FT 


CARBOHYD 


287 


287 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


SQ 


SEQUENCE 


911 AA; 102278 MW; 69353D0664C5D689 CRC64 ; 


Query Match 




7 0.7%; 


Score 29; DB 1; Length 911; 



Best Local Similarity 100.0%; Pred. No. 2.3e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 LVFFAE 6 

I I I I I I 

Db 352 LVFFAE 357 



RESULT 59 
SYI_ARCFU 

ID SYI_ARCFU STANDARD; PRT; 1018 AA. 

AC 029622; 

DT 15-JUL-1998 (Rel. 36, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Isoleucyl-tRNA synthetase (EC 6.1.1.5) (Isoleucine — tRNA ligase) 

DE (IleRS) . 

GN ILES OR AF0633. 

OS Archaeoglobus fulgidus . 

OC Archaea; Euryarchaeota ; Archaeoglobi ; Archaeoglobales ; 

OC Archaeoglobaceae; Archaeoglobus. 

OX NCBI_TaxID=2234 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=VC-16 / DSM 4304 / ATCC 49558; 

RX MEDLINE=98049343; PubMed=9389475; 

RA Klenk H.-P., Clayton R.A. , Tomb J.-F., White O. , Nelson K.E., 

RA Ketchum K.A., Dodson R.J., Gwinn M. , Hickey E.K., Peterson J.D., 

RA Richardson D.L., Kerlavage A.R., Graham D.E., Kyrpides N.C., 

RA Fleischmann R.D., Quackenbush J., Lee N.H., Sutton G.G., Gill S., 

RA Kirkness E.F., Dougherty B.A., McKenney K., Adams M.D., Loftus B. , 

RA Peterson S., Reich C.I. r McNeil L.K., Badger J.H., Glodek A., Zhou L., 

RA Overbeek R. , Gocayne J.D., Weidman J.F., McDonald L., Utterback T., 

RA Cotton M.D., Spriggs T., Artiach P., Kaine B.P., Sykes S.M., 

RA Sadow P.W., D f Andrea K.P., Bowman C, Fujii C, Garland S.A. , 

RA Mason T.M., Olsen G.J., Fraser CM. , Smith H.O., Woese C.R., 

RA Venter J. C. ; 



RT 
RT 
RL 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
KW 
FT 
FT 
FT 
SQ 



"The complete genome sequence of the hyperthermophilic, sulphate- 
reducing archaeon Archaeoglobus fulgidus."; 
Nature 390:364-370(1997). 

-!- CATALYTIC ACTIVITY: ATP + L-isoleucine + tRNA(Ile) - AMP + 
diphosphate + L-isoleucyl-tRNA( lie) . 

- COFACTOR: Binds 1 zinc ion per subunit (By similarity) . 

- SUBCELLULAR LOCATION: Cytoplasmic. 

- SIMILARITY: Belongs to class-I aminoacyl-tRNA synthetase family. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib.ch). 

EMBL; AE001061; AAB90608.1; -. 
PIR; A69329; A69329. 
HSSP; P56690; 1ILE. 
TIGR; AF0633; -. 

InterPro; IPR002300; tRNA-synt__la . 
InterPro; IPR001412; tRNA-synt_I. 
InterPro; IPR002301; tRNA-synt_ile . 
InterPro; IPR009008; ValRS_IleRS_edit . 
Pfam; PF00133; tRNA-synt_l; 1. 
PRINTS; PR00984; TRNASYNTHILE. 
TIGRFAMs ; TIGR00392; ileS; 1. 
PROSITE; PS00178; AA__T RN A_L I GAS E_ I ; 1. 

Aminoacyl-tRNA synthetase; Protein biosynthesis; Ligase; ATP-binding; 
Metal-binding; Zinc; Complete proteome. 
SITE 
SITE 
BINDING 
SEQUENCE 



43 53 
586 590 
589 589 
1018 AA; 118884 



"HIGH" REGION. 
"KMSKS" REGION. 
ATP (BY SIMILARITY) . 
MW; 58C8BBE472FF1D0F CRC64; 



Query Match 70.7%; 
Best Local Similarity 83.3%; 
Matches 5; Conservative 



Score 29; DB 1; Length 1018; 
Pred. No. 2.6e+02; 
0; Mismatches 1; Indels 



0; Gaps 



0; 



QY 



Db 



700 



FFAEDF 8 

II Ml 

FFVEDF 705 



RESULT 60 
VNS2_CVBM 

ID VNS2_CVBM STANDARD; PRT; 109 AA. 

AC P15774; 

DT 01-APR-1990 (Rel. 14, Created) 

DT 01-APR-1990 (Rel. 14 f Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Nonstructural protein 2 (Nonstructural 12.7 kDa protein). 

GN NS2 . 

OS Bovine coronavirus (strain Mebus) (BCoV) (BCV) , and 
OS Bovine coronavirus (strain F15) (BCoV) (BCV) . 

OC Viruses; ssRNA positive-strand viruses, no DNA stage; Nidovirales; 



OC Coronaviridae; Coronavirus. 

OX NCBI__TaxID=11132, 11129; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Mebus ; 

RX MEDLINE-90320120; PubMed=2142 556; 

RA Abraham S., Kienzle T.E., Lapps W.E., Brian D.A. ; 

RT "Sequence and expression analysis of potential nonstructural proteins 

RT of 4.9, 4.8, 12.7, and 9.5 kDa encoded between the spike and membrane 

RT protein genes of the bovine coronavirus."; 

RL Virology 177:488-495(1990). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=F15; 

RX MEDLINE=9 02 0 6809; PubMed=2 320429; 

RA Woloszyn N., Boireau P., Laporte J.; 

RT "Nucleotide sequence of the bovine enteric coronavirus BECV F15 mRNA 

RT 5 and mRNA 6 unique regions."; 

RL Nucleic Acids Res. 18:1303-1303(1990). 

CC -!- SIMILARITY: Belongs to the coronavirus NS2 protein family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch ) . 

CC 

DR EMBL; X51347; CAA35740.1; -. 

DR EMBL; U00735; AAA42913.1; -. 

DR PIR; S08408; MNIHB2 . 

DR InterPro; IPR006841; Corona_NS2. 

DR Pfam; PF04753; Corona_NS2; 1. 

KW Nonstructural protein. 

SQ SEQUENCE 109 AA; 12806 MW; 08B7CA339A1BD051 CRC64; 

Query Match 68.3%; Score 28; DB 1; Length 109; 

Best Local Similarity 75.0%; Pred. No. 46; 

Matches 6; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

III I II 

Db 90 LVFLAVDF 97 



RESULT 61 
VNS2_CVH0C 

ID VNS2_CVH0C STANDARD; PRT; 109 AA. 

AC Q04853; 

DT 01-FEB-1994 (Rel. 28, Created) 

DT 01-FEB-1994 (Rel. 28, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Nonstructural protein 2 (Nonstructural 12.9 kDa protein). 

GN NS2. 

OS Human coronavirus (strain 0C43) (HCoV-0C43) . 

OC Viruses; ssRNA positive-strand viruses, no DNA stage; Nidovirales; 



OC Coronaviridae; Coronavirus . 

OX NCBI_TaxID=31631; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX ME DLINE= 93297129; PubMed= 8517026; 

RA Mounir S., Talbot P.J.; 

RT "Human coronavirus OC43 RNA 4 lacks two open reading frames located 

RT downstream of the S gene of bovine coronavirus."; 

RL Virology 192:355-360(1993). 

CC -!- FUNCTION: May act as membrane-anchoring region for structural 

CC proteins during virus assembly, or play a role in membrane 

CC association of the viral polymerase during replication. 

CC -!- SIMILARITY: Belongs to the coronavirus NS2 protein family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

CC 

DR EMBL; M99576; AAA02569.1; -. 

DR PIR; A44275; A44275. 

DR InterPro; IPR006841; Corona_NS2 . 

DR Pfam; PF04753; Corona_NS2 ; 1. 

KW Nonstructural protein. 

SQ SEQUENCE 109 AA; 12935 MW; 8F06A6E9E30B1667 CRC64; 

Query Match 68.3%; Score 28; DB 1; Length 109; 
Best Local Similarity 75.0%; Pred. No. 46; 

Matches 6; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 ' 

I I I I I I 

Db 90 LVFLAVDF 97 



RESULT 62 
RL 7 A_HALMA 

ID RL7A_HALMA STANDARD; PRT; 119 AA. 

AC P12743; 

DT 01-OCT-1989 (Rel. 12, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE 50S ribosomal protein L7Ae (HS6) . 

GN RPL7AE. 

OS Haloarcula marismortui (Halobacterium marismortui) . 

OC Archaea; Euryarchaeota ; Halobacteria ; Halobacteriales ; 

OC Halobacteriaceae; Haloarcula. 

OX NCBI_TaxID=2238; 

RN [1] 

RP SEQUENCE. 

RX MEDLINE=88055606; PubMed=33157 4 8 ; 
RA Kimura J., Arndt E., Kirnura M. ; 

RT "Primary structures of three highly acidic ribosomal proteins S6, S12 
RT and S15 from the archaebacterium Halobacterium marismortui."; 



RL 

RN 

RP 

RC 

RX 

RA 

RT 

RT 

RL 

CC 

CC 

CC 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 

FT 

FT 

SQ 



FEBS Lett. 224:65-7 0(1987). 
[2] 

X-RAY CRYSTALLOGRAPHY (2.4 ANGSTROMS), AND REVISIONS TO 108-110. 
STRAIN-ATCC 43049; 

MEDLINE=2 039634 4; PubMed=10937989; 

Ban N., Nissen P., Hansen J., Moore P.B., Steitz T.A. ; 

"The complete atomic structure of the large ribosomal subunit at 2. 

A resolution."; 

Science 289:905-920(2000). 

-!- SIMILARITY: Belongs to the L7AE family of ribosomal proteins. 
-!- CAUTION: Was originally (Ref.l) thought to be a protein from th 

small subunit of the ribosome. 
PIR; S00182; R5HSS6. 



PDB; 1FFK 

PDB; 1K8A 

PDB; 1K9M 

PDB; 1KD1 

PDB; 1M1K 

PDB; 1M90 



26-SEP-01. 
19-JUL-02 . 
19-JUL-02 . 
19-JUL-02. 
23-AUG-02. 



06-SEP-02. 
HAMAP; MF_00326; -; 1. 
Inter Pro; IPR004038; Ribosomal_L7A. 
InterPro; IPR004037; Ribosomal__L7Ae . 
Pfam; PF01248; Ribosomal_L7Ae ; 1 . 
PRINTS; PR00881; L7ARS6FAMILY . 
PROSITE; PS01082; RIBOSOMAL_L7AE; 1 . 
Ribosomal protein; 3D-structure . 
INIT_MET 0 0 

CONFLICT 108 110 MISSING (IN REF. 1) . 

SEQUENCE 119 AA; 12591 MW; A132AA40519F45C5 CRC64; 



Query Match 68.3%; Score 28; DB 1; Length 119; 

Best Local Similarity 85.7%; Pred. No. 50; 

Matches 6; Conservative 0; Mismatches 1; Indels 



0; Gap 



QY 
Db 



1 LVFFAED 7 

III III 
47 LVFVAED 53 



RESULT 63 
Y328_SYNY3 

ID Y328_SYNY3 STANDARD; PRT; 157 AA. 

AC Q55535; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Putative low molecular weight protein-tyrosine-phosphatase slr0328 
DE (EC 3.1.3.48) . 
GN SLR0328. 

OS Synechocystis sp. (strain PCC 6803) . 

OC Bacteria; Cyanobacteria; Chroococcales ; Synechocystis. 
OX NCBI_TaxID=1148; 
RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=96127529; PubMed=8590279 ; 

RA Kaneko T., Tanaka A., Sato S., Kotani H., Sazuka T . , Miyajima N . , 
RA Sugiura M. , Tabata S.; 



RT "Sequence analysis of the genome of the unicellular cyanobacterium 

RT Synechocystis sp. strain PCC6803. I. Sequence features in the 1 Mb 

RT region from map positions 64% to 92% of the genome."; 

RL DNA Res. 2:153-166(1995). 

CC -!- CATALYTIC ACTIVITY: Protein tyrosine phosphate + H{2)G = protein 
CC tyrosine + phosphate. 

CC -!- SIMILARITY: Belongs to the low molecular weight phosphotyrosine 
CC protein phosphatase family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; D63999; BAA10030.1; -. 

DR PIR; S76052; S76052. 

DR HSSP; P11064; 1DG9. 

DR InterPro; IPR000106; Low_mwt_PTPase . 

DR Pfam; PF01451; LMWPc; 1. 

DR PRINTS; PR00719; LMWPTPASE. 

DR SMART; SM00226; LMWPc; 1. 

KW Hypothetical protein; Hydrolase; Complete proteome. 

FT ACT_SITE 7 7 NUCLEOPHILE (BY SIMILARITY) . 

FT ACT_SITE 12 12 BY SIMILARITY. 

FT ACT_SITE 124 124 PROTON DONOR (BY SIMILARITY) . 

SQ SEQUENCE 157 AA; 17470 MW; D642D8E6D3A497 51 CRC64; 



Query Match 68.3%; Score 28; DB 1; Length 157; 

Best Local Similarity 83.3%; Pred. No. 66; 

Matches 5; Conservative 0; Mismatches 1; Indels 



0; Gaps 



0; 



QY 
Db 



3 FFAEDF 8 

II II I 

72 FFPEDF 77 



RESULT 64 
DESS_MYXXA 

ID DESS_MYXXA STANDARD; PRT; 173 AA. 

AC P02966; 

DT 21-JUL-1986 (Rel. 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Development-specific protein S (Spore coat protein S) . 

GN TPS. 

OS Myxococcus xanthus . 

OC Bacteria; Proteobacteria ; Deltaproteobacteria; Myxococcales ; 

OC Cystobacterineae; Myxococcaceae; Myxococcus. 

OX NCBI_TaxID=34; 
RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=84070723; PubMed=631632 8 ; 

RA Inouye S., Franceschini T., Inouye M. ; 

RT "Structural similarities between the development-specific protein S 



RT from a Gram-negative bacterium, Myxococcus xanthus, and calmodulin."; 

RL Proc. Natl. Acad. Sci. U.S.A. 80:6829-6833(1983). 

RN [2] 

RP CALCIUM- BINDING SITES, AND MUTAGENESIS. 

RX MEDLINE=88087248; PubMed-3121626; 

RA Teintze M. , Inouye M. , Inouye S.; 

RT "Characterization of calcium-binding sites in development-specific 

RT protein S of Myxococcus xanthus using site-specific mutagenesis."; 

RL J. Biol. Cheiru 263:1199-1203(1988). 

RN [3] 

RP STRUCTURE BY NMR. 

RX MEDLINE=94240126; PubMed=8183906; 

RA Bagby S., Harvey T.S., Eagle S.G., Inouye S., Ikura M. ; 

RT "Structural similarity of a developmentally regulated bacterial spore 

RT coat protein to beta gamma-crys tallins of the vertebrate eye lens."; 

RL Proc. Natl. Acad. Sci. U.S.A. 91:4308-4312(1994). 

RN [4] 

RP STRUCTURE BY NMR. 

RX MEDLINE=94363227; PubMed=8081742 ; 

RA Bagby S., Harvey T.S., Eagle S.G., Inouye S., Ikura M. ; 

RT "NMR-derived three-dimensional solution structure of protein S 

RT complexed with calcium."; 

RL Structure 2:107-122(1994). 

cc _i_ FUNCTION: PROTEIN S, INDUCED IN LARGE AMOUNTS DURING FRUITING BODY 
CC FORMATION, ASSEMBLES ON THE SURFACE OF MYXOSPORES IN THE PRESENCE 

CC OF CALCIUM ION. IT BINDS TWO MOLES OF CALCIUM. 

CC -!- DOMAIN: Has a two-domain beta-structure, folded into four very 
CC similar greek key motifs. 

CC -!- SIMILARITY: Belongs to the beta/gamma-crystallin family. 

CC -!- SIMILARITY: Contains 4 beta/gamma crystallin 'Greek key 1 domains. 

CC 7 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; J01745; AAA25407.1; -. 

DR PIR; A03490; DZYZSX. 

DR PDB; 1PRR; 31-AUG-94. 

DR PDB; 1PRS; 31-AUG-94. 

DR PDB; 1NPS; 10-JAN-01. 

DR InterPro; IPR001064; Crystallin. 

DR Pfam; PF00030; crystall; 2. 

DR SMART; SM00247; XTALbg; 2. 

DR PROSITE; PS50915; C RY S T AL L I N_B ET AGAMMA ; 4. 

KW Calcium-binding; Repeat; Sporulation; 3D-structure . 

FT DOMAIN 2 46 BETA/ GAMMA CRYSTALLIN 'GREEK KEY 1 1. 

FT DOMAIN 4 8 8 6 BETA/ GAMMA CRYSTALLIN 'GREEK KEY' 2. 

FT DOMAIN 87 90 CONNECTING PEPTIDE. 

FT DOMAIN 91 135 BETA/ GAMMA CRYSTALLIN ' GREEK KEY ' 3. 

FT DOMAIN 136 173 BETA/ GAMMA CRYSTALLIN 'GREEK KEY' 4. 

FT CA_BIND 32 46 PROBABLE. 

FT CA_BIND 121 135 PROBABLE. 

FT MUTAGEN 40 40 S->R: SUPPRESSES COMPLETELY CA-BINDING. 



FT 


MUTAGEN 


1 O Q 


1 O Q 

iz y 


FT 


STRAND 


A 

4 


/ 


FT 


TURN 


1 1 


±z 


FT 


STRAND 


16 


19 


FT 


STRAND 


o o 
ZZ 


O A 

z4 


FT 


HELIX 


O /- 

ZO 


62. 


FT 


TURN 


3 / 


o o 
JO 


FT 


STRAND 


41 




FT 


STRAND 


48 


53 


FT 


TURN 


56 


58 


FT 


STRAND 


62 


65 


FT 


STRAND 


68 


69 


FT 


TURN 


70 


71 


FT 


HELIX 


74 


77 


FT 


STRAND 


81 


86 


FT 


STRAND 


93 


95 


FT 


TURN 


99 


101 


FT 


STRAND 


106 


108 


FT 


STRAND 


111 


113 


FT 


TURN 


115 


118 


FT 


HELIX 


119 


122 


FT 


STRAND 


131 


132 


FT 


STRAND 


138 


141 


FT 


HELIX 


144 


146 


FT 


STRAND 


150 


153 


FT 


STRAND 


157 


157 


FT 


TURN 


161 


163 


FT 


STRAND 


169 


173 


SQ 


SEQUENCE 


173 AA; 


18 



S->R: LOWER CALCIUM AFFINITY 



18793 MW; C504A6409752C454 CRC64; 



Query Match 68.3%; Score 28; DB 1; Length 173; 

Best Local Similarity 71.4%; Pred. No. 73; 

Matches 5; Conservative 1; Mismatches 1; Indels 



0; Gaps 



0; 



Qy 2 VFFAEDF 8 

II: III 

Db 6 VFYNEDF 12 



RESULT 65 
DEST_MYXXA 

ID DEST_MYXXA STANDARD; PRT; 175 AA. 

AC P02967; 

DT 21-JUL-1986 (Rel. 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Development-specific protein S homolog. 

GN OPS. 

OS Myxococcus xanthus . 

OC Bacteria; Proteobacteria ; Deltaproteobacteria; Myxococcales ; 
OC Cystobacterineae; Myxococcaceae; Myxococcus. 

OX NCBI_TaxID=34; 
RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=84070723; PubMed=6316328 ; 

RA Inouye S., Franceschini T., Inouye M. ; 

RT "Structural similarities between the development-specific protein S 



RT 

RL 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

DR 

DR 

DR 

DR 

DR 



from a Gram-negative bacterium, Myxococcus xanthus, and calmodulin."; 
Proc. Natl. Acad. Sci. U.S.A. 80:6829-6833(1983). 
-!- SUBCELLULAR LOCATION: INSIDE THE SPORES. 

-!- DOMAIN: Has a two-domain beta-structure, folded into four very 

similar greek key motifs. 
-!- SIMILARITY: Belongs to the beta/gamma-crystallin family. 
-!- SIMILARITY: Contains 4 beta/gamma crystallin ' Greek key' domains. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; J01745; AAA25406.1; 

PIR; A03491; DZYZlX. 

HSSP; P02966; 1PRS. 

InterPro; IPR001064; Crystallin. 

Pfam; PF00030; crystall; 2. 



DR 


SMART; SM00247; XTALbg; 2. 










DR 


PROSITE; 


PS50915; 


CRYSTALLIN_ 


BETAGAMMA; 4 . 








KW 


Sporulation; Repeat. 










FT 


DOMAIN 


2 


46 


BETA/ GAMMA CRYSTALLIN 


1 GREEK 


KEY 1 


1. 


FT 


DOMAIN 


48 


86 


BETA/ GAMMA CRYSTALLIN 


1 GREEK 


KEY 1 


2. 


FT 


DOMAIN 


87 


90 


CONNECTING PEPTIDE. 








FT 


DOMAIN 


91 


135 


BETA/ GAMMA CRYSTALLIN 


' GREEK 


KEY' 


3. 


FT 


DOMAIN 


136 


175 


BETA/ GAMMA CRYSTALLIN 


* GREEK 


KEY 1 


4. 


SQ 


SEQUENCE 


175 AA; 


19235 MW; 


9514FE18D373F9B7 CRC64 ; 







Query Match 68.3%; 
Best Local Similarity 71.4%; 
Matches 5; Conservative 



Score 28; DB 1; Length 175; 
Pred. No. 73; 
1; Mismatches 1; Indels 



0; Gaps 



0; 



Qy 

Db 



2 VFFAEDF 8 

II: III 
6 VFYNEDF 12 



RESULT 66 
CME2_BACSU 

ID CME2_BACSU STANDARD; PRT; 189 AA. 

AC P32393; 

DT 01-OCT-1993 (Rel. 27, Created) 

DT 01-OCT-1993 (Rel. 27, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE ComE operon protein 2 . 

GN COMEB OR COME2 OR BSU25580. 

OS Bacillus subtilis. 

OC Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus. 
OX NCBI_TaxID=1423 ; 
RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=95058187; PubMed=7 968523; 

RA Hahn J., Inamine G., Kozlov Y. , Dubnau D.A. ; 

RT "Characterization of comE, a late competence operon of Bacillus 



RT subtilis required for the binding and uptake of transforming DNA."; 

RL Mol. Microbiol. 10:99-110(1993). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=168 / JH642; 

RX MEDLINE=97124195; PubMed=8 969508 ; 

RA Mizuno M. , Masuda S., Takemaru K.-I., Hosono S., Sato T., Takeuchi M. r 

RA Kobayashi Y. ; 

RT "Systematic sequencing of the 283 kb 210 degrees-232 degrees region of 

RT the Bacillus subtilis genome containing the skin element and many 

RT sporulation genes."; 

RL Microbiology 142:3103-3111(1996). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=168; 

RX MEDLINE=98044033; PubMed=9384 377 ; 

RA Kunst F., Ogasawara N . , Moszer I., Albertini A.M., Alloni G. , 

RA Azevedo V. , Bertero M.G., Bessieres P., Bolotin A., Borchert S., 

RA Borriss R., Boursier L., Brans A. , Braun M. , Brignell S.C., Bron S., 

RA Brouillet S., Bruschi C.V., Caldwell B., Capuano V., Carter N.M., 

RA Choi S.K., Codani J. J., Connerton I.F., Cummings N.J., Daniel R.A. , 

RA Denizot F., Devine K.M., Dusterhoft A., Ehrlich S.D., Emmerson P.T., 

RA Entian K.D., Errington J., Fabret C, Ferrari E., Foulger D., 

RA Fritz C, Fujita M. r Fujita Y. f Fuma S., Galizzi A., Galleron N., 

RA Ghim S.Y., Glaser P., Goffeau A. , Golightly E.J., Grandi G. , 

RA Guiseppi G. , Guy B.J., Haga K. r Haiech J., Harwood C.R., Henaut A., 

RA Hilbert H., Holsappel S., Hosono S., Hullo M.F., Itaya M. , Jones L., 

RA Joris B., Karamata D., Kasahara Y. r Klaerr-Blanchard M. f Klein C, 

RA Kobayashi Y. , Koetter P., Koningstein G. , Krogh S., Kumano M. , 

RA Kurita K. , Lapidus A., Lardinois S., Lauber J., Lazarevic V., 

RA Lee S.M., Levine A., Liu H., Masuda S. f Mauel C, Medigue C, 

RA Medina N., Mellado R.P., Mizuno M. , Moestl D., Nakai S., Noback M. , 

RA Noone D., O'Reilly M., Ogawa K. f Ogiwara A., Oudega B., Park S.H., 

RA Parro V., Pohl T.M., Portetelle D. f Porwollik S., Prescott A.M., 

RA Presecan E. , Pujic P., Purnelle B. , Rapoport G., Rey M. , Reynolds S., 

RA Rieger M. , Rivolta C, Rocha E. f Roche B., Rose M. , Sadaie Y., 

RA Sato T., Scanlan E., Schleich S., Schroeter R. , Scoff one F. , 

RA Sekiguchi J., Sekowska A., Seror S.J., Serror P., Shin B.S., Soldo B., 

RA Sorokin A. , Tacconi E., Takagi T., Takahashi H. f Takemaru K., 

RA Takeuchi M. , Tamakoshi A., Tanaka T . , Terpstra P., Tognoni A., 

RA Tosato V., Uchiyama S., Vandenbol M. f Vannier F., Vassarotti A. , 

RA Viari A., Wambutt R. , Wedler E., Wedler H., Weitzenegger T., 

RA Winters P., Wipat A., Yamamoto H., Yamane K., Yasumoto K., Yata K., 

RA Yoshida K., Yoshikawa H.F-, Zumstein E., Yoshikawa H., Danchin A. ; 

RT "The complete genome sequence of the Gram-positive bacterium Bacillus 

RT subtilis. "; 

RL Nature 390:249-256(1997). 

CC -!- FUNCTION: DISPENSABLE FOR TRANS FORMABILITY . 

CC -!- COFACTOR: Zinc (By similarity). 

CC -!- SIMILARITY: Belongs to the cytidine and deoxycytidylate deaminases 
CC family. 

cc 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 



CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; L15202; AAC36906.1; -. 

DR EMBL; D84432; BAA12453.1; 

DR EMBL; Z99117; CAB14500.1; -. 

DR PIR; S39864; S39864. 

DR SubtiList; BG10481; comEB. 

DR InterPro; IPR002125; dCMP/cyt_deam. 

DR Pfam; PF00383; dCMP_cyt_deam; 1. 

DR PROSITE; PS00903; CYT_DCMP_DEAMINASES ; 1. 

KW Hydrolase; Zinc; Complete proteome. 

FT METAL 70 70 ZINC (BY SIMILARITY) . 

FT METAL 98 98 ZINC (BY SIMILARITY) . 

FT METAL 101 101 ZINC (BY SIMILARITY) . 

SQ SEQUENCE 189 AA; 20969 MW; B6AC951966EE2D37 CRC64; 



Query Match 68.3%; Score 28; DB 1; Length 189; 

Best Local Similarity 57.1%; Pred. No. 79; 

Matches 4; Conservative 3; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 



Db 



2 VFFAEDF 8 

I :: I I I : 

113 VYYAEDY 119 



RESULT 67 




BCB1_ 


_ARATH 




ID 


BCB1 ARATH STANDARD; PRT; 196 AA. 




AC 


Q07488; 082664; 




DT 


01-FEB-1995 (Rel. 31, Created) 




DT 


28-FEB-2003 (Rel. 41, Last sequence update) 




DT 


15-MAR-2004 (Rel. 43, Last annotation update) 




DE 


Blue copper protein precursor (Blue copper-binding protein) 


(AtBCB) 


DE 


( Stellacyanin) (Phytocyanin 1) . 




GN 


BCB OR AWI 32 OR AT5G20230 OR F5O24.120. 




OS 


Arabidopsis thaliana (Mouse-ear cress). 




OC 


Eukaryota; Viridiplantae; Streptophyta ; Embryophyta; Tracheophyta ; 


OC 


Spermatophyta; Magnoliophyta; eudicotyledons ; core eudicots; 


rosids ; 


OC 


eurosids II; Brassicales; Brassicaceae; Arabidopsis. 




OX 


NCBI TaxID=3702; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=cv. Columbia, and cv. Columbia K85; 




RX 


MEDLINE=94124044; PubMed=8294 044 ; 




RA 


van Gysel A., van Montagu M. , Inze D. ; 




RT 


"A negatively light-regulated gene from Arabidopsis thaliana 


encodes 


RT 


a protein showing high similarity to blue copper-binding proteins ; 


RL 


Gene 136 : 79-85 ( 1993 ) . 




RN 


[2] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=cv. Wassilewskija; 




RA 


Yang K.Y., Kim C.S., Cho B.H.; 




RT 


"Characterization of a wound-inducible Arabidopsis gene enco 


ding a 


RT 


protein homologous to blue copper binding proteins."; 




RL 


Submitted (OCT-1998) to the EMBL/ GenBank/DDBJ databases. 




RN 


[3] 





RP SEQUENCE FROM N.A. 

RC STRAIN=cv. Landsberg erecta; 

RX MEDLINE=20233824; PubMed=10769227 ; 

RA Honma T., Goto K. ; 

RT "The Arabidopsis floral homeotic gene PISTILLATA is regulated by 

RT discrete cis- elements responsive to induction and maintenance 

RT signals."; 

RL Development 127:2021-2030(2000). 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=cv. Columbia; 

RX MEDLINE=21016721; PubMed=11130714 ; 

RA Tabata S., Kaneko T . , Nakamura Y., Kotani H. r Kato T., Asamizu E- r 

RA Miyajima N. , Sasamoto S., Kimura T., Hosouchi T. f Kawashima K. f 

RA Kohara M., Matsumoto M., Matsuno A., Muraki A., Nakayama S., 

RA Nakazaki N. f Naruo K. , Okumura S . , Shinpo S, f Takeuchi C, Wada T., 

RA Watanabe A,, Yamada M. , Yasuda M. , Sato S., de la Bastide M. T 

RA Huang E., Spiegel L. f Gnoj L. r 0 1 Shaughnes'sy A., Preston R. , 

RA Habermann K., Murray J., Johnson D., Rohlfing T., Nelson J., 

RA Stoneking T., Pepin K. , Spieth J. r Sekhon M. , Armstrong J. , Becker M., 

RA Belter E. f Cordum H., Cordes M- , Courtney L. f Courtney W. , Dante M. , 

RA Du H., Edwards J., Fryman J., Haakensen B. f Lamar E., Latreille P., 

RA Leonard S., Meyer R . f Mulvaney E. , Ozersky P., Riley A. , Strowmatt C. r 

RA Wagner-McPherson C, Wollam A., Yoakum M. , Bell M. , Dedhia N w 

RA Parnell L., Shah R. , Rodriguez M. , Hoon See L., Vil D., Baker J., 

RA Kirchoff K., Toth K. , King L., Bahret A. r Miller B., Marra M.A. r 

RA Martienssen R., McCombie W.R., Wilson R.K., Murphy G., Bancroft I., 

RA Volckaert G., Wambutt R., Duesterhoeft A. r Stiekema W., Pohl T. f 

RA Entian K.-D., Terryn N., Hartley N., Bent E., Johnson S., 

RA Langham S.-A., McCullagh B., Robben J., Grymonprez B. f Zimmermann W., 

RA Ramsperger U., Wedler H., Balke K. , Wedler E. , Peters S., 

RA van Staveren M., Dirkse W., Mooijman P., Klein Lankhorst R., 

RA Weitzenegger T., Bothe G., Rose M. r Hauf J., Berneiser S., Hempel S., 

RA Feldpausch M. , Lamberth S . f Villarroel R., Gielen J. , Ardiles W. , 

RA Bents O., Lemcke K. , Kolesov G. , Mayer K.F.X. f Rudd S . , Schoof H., 

RA Schueller C. r Zaccaria P., Mewes H.-W., Bevan M. , Fransz P.F.; 

RT "Sequence and analysis of chromosome 5 of the plant Arabidopsis 

RT thaliana."; 

RL Nature 408:823-826(2000). 

RN [5] 

RP SEQUENCE FROM N.A. 

RC STRAIN^cv. Columbia; 

RX MEDLINE-22954850; PubMed-14593172 ; 

RA Yamada K. T Lim J. f Dale J.M. , Chen H., Shinn P., Palm C.J., 

RA Southwick A.M., Wu H.C., Kim C.J., Nguyen M., Pham P.K., Cheuk R.F., 

RA Karlin-Newmann G. , Liu S.X., Lam B., Sakano H., Wu T., Yu G. , 

RA Miranda M. , Quach H.L., Tripp M. r Chang C.H., Lee J.M., Toriumi M.J., 

RA Chan M.M. , Tang C.C., Onodera C.S., Deng J.M., Akiyama K. , Ansari Y., 

RA Arakawa T . , Banh J., Banno F. , Bowser L. , Brooks S.Y., Carninci P., 

RA Chao Q. f Choy N., Enju A. , Goldsmith A. D . , Gurjal M. f Hansen N.F., 

RA Hayashizaki Y., Johnson-Hopson C, Hsuan V.W., Iida K. , Karnes M. , 

RA Khan S. r Koesema E. , Ishida J, r Jiang P.X., Jones T., Kawai J., 

RA Kamiya A., Meyers C, Nakajima M., Narusaka M. , Seki M. , Sakurai T., 

RA Satou M., Tamse R. , Vaysberg M. , Wallender E.K., Wong C, Yamamura Y., 

RA Yuan S., Shinozaki K., Davis R.W., Theologis A., Ecker J.R.; 

RT "Empirical analysis of transcriptional activity in the Arabidopsis 

RT genome . " ; 



RL Science 302:842-846(2003). 

RN [6] 

RP SEQUENCE FROM N.A. 

RA Brover V., Troukhan M. , Alexandrov N., Lu Y.-P., Flavell R. , 

RA Feldmann K.A. ; 

RT "Full-length cDNA from Arabidopsis thaliana."; 

RL Submitted (MAR-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [7] 

RP GPI-ANCHOR. 

RX MEDLINE=22690167; PubMed=128 05588 ; 

RA Borner G.H., Lilley K.S., Stevens T.J., Dupree P.; 

RT "Identification of glycosylphosphatidylinositol-anchored proteins in 

RT Arabidopsis. A proteomic and genomic analysis."; 

RL Plant Physiol. 132:568-577(2 003). 

CC -!- FUNCTION: Probably acts as an electron carrier. 

CC -!- SUBCELLULAR LOCATION: Attached to the membrane by a GPI-anchor. 

CC -!- DEVELOPMENTAL STAGE: Maximum levels are found in 35 day old 

CC plantlets when the rosette is mature, consisting of 8-10 fully 

CC expanded leaves, and as the floral stem starts to form. This level 

CC remains constant during the further life span of the plant. 

cc _j_ INDUCTION: By dark adaptation. This gives a 20-fold increase in 

CC expression. 

CC -!- SIMILARITY: Contains 1 plastocyanin-like domain. 

CC 7 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; Z15058; CAA78771.1; -. 

DR EMBL; Y18227; CAA77089.1; -. 

DR EMBL; AB035137; BAA86999.1; -. 

DR EMBL; AF296825; -; NOT_ANNOTATED_CDS . 

DR EMBL; AY052681; AAK96585.1; -. 

DR EMBL; AY034986; AAK59491.1; 

DR EMBL; AY142577; AAN13146.1; -. 

DR EMBL; AY088549; AAM66081.1; -. 

DR PIR; 139698; 139698. 

DR PIR; T51838; T51838. 

DR HSSP; P29602; 1JER. 

DR InterPro; IPR000923; BlueCu_l . 

DR InterPro; IPR008972; Cupredoxin. 

DR InterPro; IPR003245; Plcyanin_like . 

DR Pfam; PF02298; Cu_bind_like; 1. 

DR ProDom; PD003122; Plcyanin_like; 1. 

DR PROSITE; PS00196; COPPER_BLUE; 1. 

KW Electron transport; Membrane; Metal-binding; Copper; Signal; 

KW Glycoprotein; GPI-anchor; Lipoprotein. 

FT SIGNAL 1 22 POTENTIAL. 

FT CHAIN 23 174 BLUE COPPER PROTEIN. 

FT ' PROPEP 175 196 REMOVED IN MATURE FORM (PROBABLE) . 

FT DOMAIN 23 118 PLASTOCYANIN-LIKE. 

FT METAL 66 66 COPPER (BY SIMILARITY) . 

FT METAL 107 107 COPPER (BY SIMILARITY) . 



FT METAL 


112 


112 


COPPER (BY SIMILARITY) . 


FT METAL 


117 


117 


COPPER (BY SIMILARITY) . 




7 Q 


113 


BY SIMILARITY. 


FT CARBOHYD 


98 


98 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT LIPID 


174 


174 


GPI-anchor amidated asparagine 


FT 






(Probable) . 


FT CONFLICT 


44 


44 


T -> S (IN REF. 1) . 


FT CONFLICT 


134 


134 


P ~> L (IN REF. 1) . 


FT CONFLICT 


142 


142 


P -> L (IN REF. 1) . 


SQ SEQUENCE 


196 AA; 


20053 MW; 05100B50518FOA56 CRC64; 


Query Match 




68.3%; 


Score 28; DB 1; Length 196; 


Best Local Similarity 


62.5%; 


Pred, No. 82; 


Matches 5; 


Conservative 


2; Mismatches 1; Indels 0; 



Gaps 



0; 



Qy 

Db 



1 LVFFAEDF 8 



18 VWFAEDY 25 



RESULT 68 
UTlljDRYSA 

ID UTll_ORYSA STANDARD; PRT; 214 AA. 

AC Q8S1Z1; 

DT 10-OCT-2003 (Rel. 42, Created) 

DT 1Q-OCT-2003 (Rel. 42, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Probable U3 small nucleolar RNA-associated protein 11 (U3 snoRNA- 

DE associated protein 11) . 

GN P0468B07.17. 

OS Oryza sativa (Rice) . 

OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; 

OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; 

OC Ehrhartoideae; Oryzeae; Oryza. 

OX NCBI_TaxID=4 530; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=cv. Nipponbare; 

RA Sasaki T., Matsumoto T., Yamamoto K. ; 

RT "Oryza sativa nipponbare (GA3 ) genomic DNA, chromosome 1, PAC 

RT clone : P0468B07. "; 

RL Submitted (FEB-2001) to the EMBL/GenBank/DDBJ databases. 

CC FUNCTION: Involved in nucleolar processing of pre~18S ribosomal 

CC RNA (By similarity) . 

CC SUBUNIT: Component of the ribosomal small subunit (SSU) 

CC processome (By similarity) . 

CC SUBCELLULAR LOCATION: Nuclear; nucleolar (By similarity). 

CC -!- SIMILARITY: Belongs to the UTP11 family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 



DR EMBL; AP003260; BAB89638.1; -. 

DR Gramene; Q8S1Z1; 

DR InterPro; IPR007144; Utpll. 

DR Pfam; PF03998; Utpll; 1. 

KW rRNA processing; Nuclear protein. 

SQ SEQUENCE 214 AA; 25321 MW; 4352134326922B43 CRC64; 

Query Match 68.3%; Score 28; DB 1; Length 214; 

Best Local Similarity 83.3%; Pred. No. 90; 

Matches 5; Conservative 1; Mismatches 0; Indels 0; Gaps 

Qy 2 VFFAED 7 

I : I I I I 

Db 132 VYFAED 137 



RESULT 69 
Y928JTHEMA 

ID Y928_THEMA STANDARD; PRT; 246 AA. 

AC P56728; P46803; 

DT 01-NOV-1995 (Rel. 32, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Hypothetical protein TM0928. 

GN TM092 8. 

OS Thermotoga maritima. 

OC Bacteria; Thermotogae; Thermotogales ; Thermotogaceae; Thermotoga. 

OX NCBI_TaxID=2336; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=MSB8 / DSM 3109 / ATCC 4358 9; 

RX MEDLINE=94222023; PubMed=8 168477 ; 

RA Darimont B., Sterner R. ; 

RT "Sequence, assembly and evolution of a primordial ferredoxin from 

RT Thermotoga maritima."; 

RL EMBO J. 13:1772-1781(1994). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=MSB8 / DSM 3109 / ATCC 43589; 

RX MEDLINE-99287316; PubMed=103 6057 1 ; 

RA Nelson K.E., Clayton R.A. , Gill S.R., Gwinn M.L., Dodson R.J., 

RA Haft D.H., Hickey E.K., Peterson J.D., Nelson W.C., Ketchum K.A., 

RA McDonald L. , Utterback T.R., Malek J. A. , Linher K.D., Garrett M.M. , 

RA Stewart A.M., Cotton M.D., Pratt M.S., Phillips C.A., Richardson D., 

RA Heidelberg J., Sutton G.G., Fleischmann R.D., Eisen J. A., White O., 

RA Salzberg S.L., Smith H.O., Venter J.C., Fraser CM. ; 

RT "Evidence for lateral gene transfer between Archaea and Bacteria from 

RT genome sequence of Thermotoga maritima."; 

RL Nature 399:323-329(1999). 

CC -!- CAUTION: REF.l SEQUENCE DIFFERS FROM THAT SHOWN DUE TO TWO 
CC FRAMESHIFTS, ONE IN POSITIONS 229-246; THE OTHER ONE FUSES 

CC TOGETHER TM0929 AND TM0928. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 



CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U24145; AAA65436.1; ALT_FRAME. 

DR EMBL; X82178; CAA57 668.1; ALT_FRAME. 

DR EMBL; AE001757; AAD36009.1; 

DR PIR; H72314; H72314. 

DR TIGR; TM0928; 

KW Hypothetical protein; Complete proteome. 

SQ SEQUENCE 246 AA; 29365 MW; 2A32302C773042C9 CRC64; 

Query Match 68.3%; Score 28; DB 1; Length 246; 

Best Local Similarity 83.3%; Pred. No. le+02; 

Matches 5; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 3 FFAEDF 8 

II III 

Db 217 FFEEDF 222 



RESULT 70 
Y128_PYRH0 

ID Y128_PYRHO STANDARD; PRT; 253 AA. 

AC 057868; 

DT 15-MAR-2004 (Rel. 43, Created) 

DT 15-MAR-2004 (Rel. 43, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Hypothetical UPF0286 protein PH0128. 

GN PH0128. 

OS Pyrococcus horikoshii. 

OC Archaea; Euryarchaeota ; Thermococci; Thermococcales ; Thermococcaceae; 

0C Pyrococcus, 

OX NCBI_TaxID=53953; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN-OT3; 

RX MEDLINE=98344137; PubMed=9679194 ; 

RA Kawarabayasi Y., Sawada M. r Horikawa H., Haikawa Y., Hino Y., 

RA Yamamoto S . , Sekine M. , Baba S.-I., Kosugi H., Hosoyama A., Nagai Y., 

RA Sakai M. , Ogura K., Otsuka R. , Nakazawa H., Takamiya M., Ohfuku Y., 

RA Funahashi T., Tanaka T., Kudoh Y., Yamazaki J., Kushida N., Oguchi A., 

RA Aoki K.-I., Yoshizawa T., Nakamura Y., Robb F.T., Horikoshi K., 

RA Masuchi Y., Shizuya H., Kikuchi H. ; 

RT "Complete sequence and gene organization of the genome of a hyper- 

RT thermophilic archaebacterium, Pyrococcus horikoshii OT3."; 

RL DNA Res. 5:55-76(1998). 

CC -!- SIMILARITY: Belongs to the UPF0286 family. 

CC . 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 



DR EMBL; AP000001; BAA29197.1; -. 

DR PIR; F71233; F71233. 

DR HAMAP; MF_00722; -; 1. 

DR InterPro; IPR002793; DUF91. 

DR Pfam; PF01939; DUF91; 1. 

DR ProDom; PD013521; DUF91; 1. 

KW Hypothetical protein; Complete proteome. 

SQ SEQUENCE 253 AA; 29057 MW; FD601BFED4 5D3DCF CRC64; 

Query Match 68.3%; Score 28; DB 1; Length 253; 

Best Local Similarity 71.4%; Pred. No. l.le+02; 

Matches 5; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 2 VFFAEDF 8 

II Ml: 
Db 112 VFLAEDY 118 



RESULT 71 
APAH_ACTAC 

ID APAH_ACTAC STANDARD; PRT; 27 5 AA. 

AC 052655; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Bis (5 1 -nucleosyl) -tetraphosphatase, symmetrical (EC 3.6.1.41) 

DE (Diadenosine tetraphosphatase) (Ap4A hydrolase) (Diadenosine 5 , ,5 ,,, - 

DE PI, P4-tetraphosphate pyrophosphohydrolase) . 

GN APAH . 

OS Actinobacillus actinomycetemcomitans (Haemophilus 

OS actinomycetemcomitans) . 

OC Bacteria; Proteobacteria; Gammaproteobacteria; Pasteurellales ; 

OC Pasteurellaceae; Actinobacillus . 

OX NCBI_TaxID=714; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Y4 ; 

RA Saarela M. , Fives-Taylor P.; 

RL Submitted (JAN-1998) to the EMBL/GenBank/DDBJ databases. 

CC -!- FUNCTION: Hydrolyzes diadenosine 5 1 , 5 1 1 1 -PI, P4-tetraphosphate to 

CC yield ADP (By similarity) . 

CC -!- CATALYTIC ACTIVITY: P ( 1 ) , P ( 4 ) -bis ( 5 1 -adenosyl ) tetraphosphate + 

CC H(2)0 = 2 ADP . 

CC -!- SIMILARITY: Belongs to the Ap4A hydrolase family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF043998; AAC00202.1; -. 

DR HAMAP; MF_00199; -; 1. 

DR InterPro; IPR004617; ApaH. 

DR InterPro; IPR004843; M-ppestrase. 



DR InterPro; IPR006186; T_phtase_apaH . 

DR Pfam; PF00149; Metallophos; 1. 

DR ProDom; PD000252; T_phtase_apaH; 1. 

DR TIGRFAMs; TIGR00668; apaH; 1. 

KW Hydrolase. 

SQ SEQUENCE 275 AA; 31480 MW; 0843F4FE11FF6DAD CRC64 ; 

Query Match 68.3%; Score 28; DB 1; Length 275; 

Best Local Similarity 71.4%; Pred. No. 1.2e+02; 

Matches 5; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 2 VFFAEDF 8 

: I I II I 
Db 89 IFAAEDF 95 



RESULT 72 
Y347_HELPJ 

ID Y347_HELPJ STANDARD; PRT; 293 AA. 

AC Q9ZMA1; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 {Rel. 42, Last annotation update) 

DE Hypothetical pseudouridine synthase JHP0321 (EC 4.2.1.70) 

DE (Pseudouridylate synthase) (Uracil hydrolyase) . 

GN JHP0321. 

OS Helicobacter pylori J99 (Campylobacter pylori J99) . 

OC Bacteria; Proteobacteria; Epsilonproteobacteria; Campylobacterales ; 

OC Helicobacteraceae; Helicobacter. 

OX NCBI_TaxID=85963 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99120557; PubMed=9923682 ; 

RA Aim R.A., Ling L.-S.L., Moir D.T., King B.L., Brown E.D., Doig P.C., 

RA Smith D.R., Noonan B., Guild B.C., deJonge B.L., Carmel G., 

RA Tummino P. J., Caruso A., Uria-Nickelsen M. , Mills D.M., Ives C, 

RA Gibson R. , Merberg D. , Mills S.D., Jiang Q. , Taylor D.E., Vovis G.F., 

RA Trust T. J. ; 

RT "Genomic sequence comparison of two unrelated isolates of the human 

RT gastric pathogen Helicobacter pylori."; 

RL Nature 397:176-180(1999). 

CC -!- CATALYTIC ACTIVITY: Uracil + D-ribose 5-phosphate = pseudouridine 
CC 5' -phosphate + H(2)0. 

CC -!- SIMILARITY: Belongs to the pseudouridine synthase rluA family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AE001468; AAD05911.1; -. 

DR PIR; A71946; A71946. 

DR InterPro; IPR006145; PseudoU_synth . 

DR InterPro; IPR006224; Rlu synth. 



DR Pfam; PF00849; PseudoU_synth_2 ; 1. 

DR ProDom; PD001819; PSI^RLU; 1. 

DR PROSITE; PS01129; PSI_RLU; FALSE_NEG. 

KW Hypothetical protein; Lyase; Complete proteome. 

FT ACT_SITE 119 119 BY SIMILARITY. 

SQ SEQUENCE 293 AA; 33668 MW; D7EF3EC54A8E6168 CRC64; 

Query Match 68.3%; Score 28; DB 1; Length 293; 

Best Local Similarity 75.0%; Pred. No. 1.2e+02; 

Matches 6; Conservative 1; Mismatches 1; Indels 0; Gaps 

Qy 1 LVFFAEDF 8 

I I I I : I I 
Db 71 LVFEAKDF 7 8 



RESULT 73 
YC64_AQUAE 

ID YC64_AQUAE STANDARD; PRT; 306 AA. 

AC 067303; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Hypothetical protein AQ_JL264. 

GN AQ_1264. 

OS Aquifex aeolicus . 

OC Bacteria; Aquificae; Aquificales; Aquif icaceae; Aquifex. 

OX NCBI_TaxID=63363; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=VF5; 

RX MEDLINE=98196666; PubMed=9537320 ; 

RA Deckert G., Warren P.V., Gaasterland T., Young W.G., Lenox A.L., 

RA Graham D.E., Overbeek R., Snead M.A. , Keller M. , Aujay M. , Huber R., 

RA Feldman R.A. , Short J.M., Olson G.J., Swanson R.V. ; 

RT "The complete genome of the hyperthermophilic bacterium Aquifex 

RT aeolicus . " ; 

RL Nature 392:353-358(1998). 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein (Potential). 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AE000731; AAC07265.1; 

DR PIR; D70409; D70409. 

KW Hypothetical protein; Transmembrane; Complete proteome. 



FT 


TRANSMEM 


13 


35 


POTENTIAL . 


FT 


TRANSMEM 


71 


93 


POTENTIAL. 


FT 


TRANSMEM 


98 


120 


POTENTIAL. 


FT 


TRANSMEM 


130 


152 


POTENTIAL. 


FT 


TRANSMEM 


165 


184 


POTENTIAL. 


FT 


TRANSMEM 


218 


240 


POTENTIAL. 



FT TRANSMEM 247 2 69 POTENTIAL. 
FT TRANSMEM 274 296 POTENTIAL. 

SQ SEQUENCE 306 AA; 36436 MW; D9BD8 0ECD8DDC6D1 CRC64 ; 



Query Match 68.3%; Score 28; DB 1; Length 306; 

Best Local Similarity 50.0%; Pred. No. 1.3e+02; 

Matches 4; Conservative 3; Mismatches 1; Indels 



0; Gaps 



0; 



Qy 

Db 



1 LVFFAEDF 8 

I : : I hi 

117 LIYFTENF 124 



RESULT 74 
ALR2 PSEAE 



ID ALR2_PSEAE STANDARD; PRT; 357 AA. 

AC Q9HTQ2 ; Q9S419; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update.) 

DE Alanine racemase, catabolic (EC 5.1.1.1). 

GN DADX OR PA5302 . 

OS Pseudornonas aeruginosa. 

OC Bacteria; Proteobacteria; Gammaproteobacteria ; Pseudomonadales ; 

OC Pseudomonadaceae; Pseudornonas. 

OX NCBIJTaxI D=2 8 7 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN-ATCC 15692 / PAOl; 

RX MEDLINE=20433225; PubMed-10977 8 98 ; 

RA Strych U., Huang H.-C, Krause K.L., Benedik M.J.; 

RT "Characterization of the alanine racemases from Pseudornonas aeruginosa 

RT PAOl."; 

RL Curr. Microbiol. 41:290-294(2000). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=ATCC 15692 / PAOl; 

RX MEDLINE=20437337; PubMed=10984043 ; 

RA Stover C.K., Pham X.-Q.T., Erwin A.L., Mizoguchi S.D., Warrener P., 

RA Hickey M.J., Brinkman F.S.L., Hufnagle W.O., Kowalik D.J., Lagrou M. , 

RA Garber R.L., Goltry L., Tolentino E., Westbrock-Wadman S., Yuan Y . , 

RA Brody L.L., Coulter S.N., Folger K.R., Kas A., Larbig K. , Lim R.M., 

RA Smith K.A., Spencer D.H., Wong G.K.-S., Wu Z., Paulsen I.T., 

RA Reizer J., Saier M.H., Hancock R.E.W., Lory S., Olson M.V. ; 

RT "Complete genome sequence of Pseudornonas aeruginosa PAOl, an 

RT opportunistic pathogen."; 

RL Nature 406:959-964(2000). 

CC -!- FUNCTION: Isomerizes L-alanine to D-alanine which is then oxidized 

CC to pyruvate by dadA (By similarity) . 

CC -!- CATALYTIC ACTIVITY: L-alanine = D-alanine. 

CC -!- COFACTOR: Pyridoxal phosphate (By similarity). 

CC -!- PATHWAY: Alanine catabolism; first step. 

CC -!- INDUCTION: By alanine. 

CC -!- SIMILARITY: Belongs to the alanine racemase family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 



cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; AF165881; AAD47081.1; -. 

EMBL; AE004943; AAG08687.1; -. 

PIR; F82982; F82982. 

HSSP; P10724; 1BD0. 

HAMAP; MF_01201; -; 1. 

InterPro; IPR000821; Ala_racemase. 

InterPro; IPR009006; Racem_decarbox_C . 

InterPro; IPR001608; UPF0001. 

Pfam; PF00842; Ala_racemase_C; 1. 

Pfam; PF01168; Ala_racemase_N; 1. 

PRINTS; PR00992; ALARACEMASE . 

TIGRFAMs; TIGR00492; air; 1. 

PROSITE; PS00395; ALAN I N E_RACEMAS E ; 1. 

Isomerase; Pyridoxal phosphate; Complete proteome. 



ACT_SITE 


33 


33 


CATALYTIC BASE SPECIFIC TO D-ALANINE (BY 








SIMILARITY) . 


ACT_SITE 


253 


253 


CATALYTIC BASE SPECIFIC TO L -ALANINE (BY 








SIMILARITY) . 


BINDING 


33 


33 


PYRIDOXAL PHOSPHATE (BY SIMILARITY) . 


CONFLICT 


138 


138 


S -> R (IN REF. 1) . 


CONFLICT 


178 


178 


A -> S (IN REF. 1) . 


CONFLICT 


262 


262 


S -> R (IN REF. 1) . 


! SEQUENCE 


357 AA; 


38914 MW; 199F4021D02FFF48 CRC64; 


Query Match 




68.3%; 


Score 28; DB 1; Length 357; 


Best Local Similarity 


83.3%; 


Pred. No. 1.5e+02; 


Matches 5; 


Conservative 


0; Mismatches 1; Indels 0; Gaps 



0; 



Qy 

Db 



3 FFAEDF 8 

II III 
132 FFPEDF 137 



RESULT 75 


YJGP 


HAEIN 


ID 


YJGP HAEIN STANDARD; PRT; 372 AA. 


AC 


P45333; 


DT 


01-NOV-1995 (Rel. 32, Created) 


DT 


01-NOV-1995 (Rel. 32, Last sequence update) 


DT 


28-FEB-2003 (Rel. 41, Last annotation update) 


DE 


Hypothetical protein HI1704. 


GN 


HI1704. 


OS 


Haemophilus influenzae. 


OC 


Bacteria; Proteobacteria ; Gammaproteobacteria; Pasteurellales ; 


OC 


Pasteurellaceae ; Haemophilus . 


OX 


NCBI TaxID=727; 


RN 


[1] 


RP 


SEQUENCE FROM N.A. 


RC 


STRAIN=Rd / KW20 / ATCC 51907; 


RX 


MEDLINE-95350630; PubMed=7542 8 00 ; 


RA 


Fleischmann R.D., Adams M.D., White O., Clayton R.A. , Kirkness E.F., 



RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RT 
RT 
RL 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



Kerlavage A.R., Bult C.J., Tomb J.-F., Dougherty B.A. , Merrick J.M., 
McKenney K., Sutton G. , Fitzhugh W., Fields C.A. , Gocayne J.D., 
Scott J.D., Shirley R. , Liu L.-I., Glodek A., Kelley J.M., 
Weidman J.F., Phillips C.A. , Spriggs T., Hedblom E . , Cotton M.D., 
Utterback T.R., Hanna M.C, Nguyen D.T., Saudek D.M., Brandon R.C., 
Fine L.D., Fritchman J.L., Fuhrmann J.L., Geoghagen N.S.M., 
Gnehm C.L., McDonald L.A., Small K.V. , Fraser CM., Smith H.O., 
Venter J. C . ; 

"Whole-genome random sequencing and assembly of Haemophilus influenzae 
Rd. "; 

Science 269:4 96-512(1995). 

-!- SUBCELLULAR LOCATION: Integral membrane protein (Potential). 
-!- SIMILARITY: STRONG, TO E.COLI YJGP . 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; U32843; AAC23350.1; -. 
PIR; A64176; A64176. 
TIGR; HI1704; -. 

InterPro; IPR005495; YjgP YjgQ. 



POTENTIAL. 
POTENTIAL. 
POTENTIAL. 
POTENTIAL. 
POTENTIAL. 
POTENTIAL. 
41336 MW; 3BEFE24 17366FB38 CRC64; 



Pfam; PFO 


3739; YjgP 


Yj 


Hypotheti 


cal protei 


n; 1 


TRANSMEM 


16 


36 


TRANSMEM 


54 


74 


TRANSMEM 


101 


121 


TRANSMEM 


266 


286 


TRANSMEM 


296 


316 


TRANSMEM 


325 


345 


SEQUENCE 


372 AA; 


41 


Query Match 




6 


Best Local S 


imilarity 


5 



Matches 



4; Conservative 



Pred. No. 1.6e+02; 
3; Mismatches 1; Indels 



0 ; Gaps 



0; 



Qy 

Db 



1 LVFFAEDF 8 

I : I I : : I 
24 LIFFSQQF 31 



RESULT 76 
RALl_ONCVO 

ID RAL1_0NCV0 STANDARD; PRT; 388 AA. 

AC P11012; 

DT 01-JUL-1989 (Rel. 11, Created) 

DT 01-NOV-1995 (Rel. 32, Last sequence update) 

DT 01-OCT-1996 (Rel. 34, Last annotation update) 

DE RAL-1 protein precursor (41 kDa larval antigen). 

GN RALl. 

OS Onchocerca volvulus . 

OC Eukaryota; Metazoa; Nematoda; Chromadorea; Spirurida; Filarioidea; 
OC Onchocercidae; Onchocerca. 



OX NCBI_TaxID=6282; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=94341871; PubMed-7520419 ; 

RA Rokeach L.A. , Zimmerman P. A., Unnasch T.R.; 

RT "Epitopes of the Onchocerca volvulus RAL1 antigen, a member of the 

RT calreticulin family of proteins, recognized by sera from patients 

RT with onchocerciasis."; 

RL Infect. Immun. 62:3696-3704(1994). 

RN [2] 

RP SEQUENCE OF 53-388 FROM N.A. 

RX MEDLINE=8 8273584; PubMed=2455736 ; 

RA Unnasch T.R., Gallin M.Y., Soboslay P.T., Erttmann K.D., Greene B.M.; 

RT "Isolation and characterization of expression cDNA clones encoding 

RT antigens of Onchocerca volvulus infective larvae."; 

RL J. Clin. Invest. 82:2 62-269(1988). 

CC -!- SIMILARITY: Belongs to the calreticulin family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; M20565; AAA59056.1; -. 

DR PIR; A32507; A32507. 



DR 


InterPro 


; IPR009033; Calret 


calnex P . 


DR 


InterPro 


; IPR001580; Calreticulin. 


DR 


InterPro 


; IPR008985; ConA like lec_gl. 


DR 


Pfam; PF00262; calreticulin; 


1. 


DR 


PIRSF; PIRSF002356; Calreticulin; 1. 


DR 


PRINTS; 


PR00626; 


CALRETICULIN. 


DR 


ProDom; 


PD001866; 


Calreticulin; 1. 


DR 


PROSITE; 


PS00803; 


CALRETICULIN_1 ; 1 . 


DR 


PROSITE; 


PS00804; 


CALRETICULIN_2; 1. 


DR 


PROSITE ; 


PS00805; 


CALRETICULIN_REPEAT; 3. 


KW 


Calcium- 


binding; 


Repeat; Antigen; Signal. 


FT 


SIGNAL 


1 


17 


POTENTIAL. 


FT 


CHAIN 


18 


388 


RAL-1 PROTEIN. 


FT 


DOMAIN 


189 


253 


4 X APPROXIMATE REPEATS. 


FT 


REPEAT 


189 


200 


1-1. 


FT 


REPEAT 


208 


219 


1-2. 


FT 


REPEAT 


225 


236 


1-3. 


FT 


REPEAT 


242 


253 


1-4. 


FT 


DOMAIN 


257 


295 


3 X APPROXIMATE REPEATS. 


FT 


REPEAT 


257 


267 


2-1. 


FT 


REPEAT 


271 


281 


2-2. 


FT 


REPEAT 


285 


295 


2-3. 


FT 


DOMAIN 


353 


388 


ARG/LYS-RICH (BASIC) . 


FT 


DISULFID 


135 


161 


BY SIMILARITY. 


SQ 


SEQUENCE 


388 AA; 45298 MW 


; 9537F298A2D31CD6 CRC64; 



Query Match 68.3%; Score 28; DB 1; Length 388; 

Best Local Similarity 57.1%; Pred. No. 1.6e+02; 

Matches 4; Conservative 2; Mismatches 1; Indels 



0; 



Gaps 



0; 



Qy 2 VFFAEDF 8 

: : I III 
Db 19 IYFKEDF 25 



RESULT 77 
PAN3_HUMAN 

ID PAN3_HUMAN STANDARD; PRT; 392 AA. 

AC Q96QZ0; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Pannexin 3 . 

GN PANX3 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 
RN [1] 

RP SEQUENCE FROM N.A. 

RA Baranova A., Ivanov D. , Skoblov M. , Pestova A., Kelmanson I., 
RA Shagin D., Usman N., Lukyanov S., Panchin Y.; 

RT "Mammalian pannexin family homologous to invertebrate gap- junction 

RT proteins are differentially expressed in nervous tissue."; 

RL Submitted (AUG-2001) to the EMBL/ GenBank/DDB J databases. 

CC -!- FUNCTION: STRUCTURAL COMPONENT OF THE GAP JUNCTIONS. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein (Potential). 

CC -!- SIMILARITY: Belongs to the innexin family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 
CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF406650; AAK95655.1; -. 

DR Genew; HGNC: 20573; PANX3. 

KW Gap junction; Transmembrane. 

FT TRANSMEM 37 57 POTENTIAL. 

FT TRANSMEM 110 132 POTENTIAL. 

FT TRANSMEM 268 290 POTENTIAL. 

SQ SEQUENCE 392 AA; 44683 MW; 8988F5DCC6E1B050 CRC64 ; 

Query Match 68.3%; Score 28; DB 1; Length 392; 

Best Local Similarity 71.4%; Pred. No. 1.6e+02; 

Matches 5; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 2 VFFAEDF 8 

III hi 

Db 234 VFFQEEF 240 



RESULT 7 8 
ASSY AQUAE 



ID ASSY_AQUAE STANDARD; PRT; 401 AA. 

AC 067213; 

DT 15-JUL-1999 (Rel. 38, Created) 

DT 15-JUL-1999 (Rel. 38, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Argininosuccinate synthase (EC 6.3.4.5) (Citrulline — aspartate 

DE ligase) . 

GN ARGG OR AQ_1140. 

OS Aquifex aeolicus. 

OC Bacteria; Aquificae; Aquificales; Aquificaceae; Aquifex. 

OX NCBI_TaxID=63363; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN-VF5; 

RX MEDLINE=98196666; PubMed-9537320 ; 

RA Deckert G. f Warren P.V., Gaasterland T., Young W.G., Lenox A.L., 

RA Graham D.E., Overbeek R. , Snead M.A. , Keller M., Aujay M. , Huber R. , 

RA Feldman R.A. , Short J.M., Olson G.J., Swanson R.V.; 

RT "The complete genome of the hyperthermophilic bacterium Aquifex 

RT aeolicus . " ; 

RL Nature 392:353-358(1998). 

CC -!- CATALYTIC ACTIVITY: ATP + L-citrulline + L-aspartate = AMP + 

CC diphosphate + L-argininosuccinate . 

CC -!- PATHWAY: Arginine biosynthesis; seventh step. 

CC -!- SUBUNIT: Homotetramer (By similarity). 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic (Probable). 

CC -!- SIMILARITY: Belongs to the argininosuccinate synthase family. 
CC Subfamily 1. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AE000725; AAC07170.1; -. 

DR PIR; B70398; B70398. 

DR HAMAP; MF_00005; -; 1. 

DR InterPro; IPR001518; Arginosuc_synth . 

DR Pfam; PF00764; Arginosuc_synth; 1. 

DR ProDom; PD003544; Arginosuc_synth ; 1. 

DR TIGRFAMs; TIGR00032; argG; 1. 

DR PROSITE; PS00564; ARGININ0SUCCIN_SYN_1 ; 1. 

DR PROSITE; PS00565; ARGININ0SUCCIN_SYN_2 ; 1. 

KW Arginine biosynthesis; Ligase; ATP-binding; Complete proteome. 

SQ SEQUENCE 401 AA; 45848 MW; 2BCBA9D027CE33CF CRC64; 



Query Match 68.3%; Score 28; DB 1; Length 401; 

Best Local Similarity 62.5%; Pred. No. 1.7e+02; 

Matches 5; Conservative 2; Mismatches 1; Indels 0; Gaps 0; 



Qy 1 LVFFAEDF 8 

I I : : I I I 

Db 101 LVYYAEKF 108 



RESULT 7 9 
NQRF HAEIN 



ID NQRF_HAEIN STANDARD; PRT; 411 AA. 

AC 005012; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Na (+) -translocating NADH-quinone reductase subunit F (EC 1.6.5.-) 

DE (Na (+) -translocating NQR subunit F) (Na (+) -NQR subunit F) (NQR complex 

DE subunit F) (NQR-1 subunit F) . 

GN NQRF OR HI0171. 

OS Haemophilus influenzae. 

OC Bacteria; Proteobacteria; Gammaproteobacteria; Pasteurellales ; 

OC Pasteurellaceae; Haemophilus. 

OX NCBI_TaxID=727; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Rd / KW20 / ATCC 51907; 

RX MEDLINE=95350630; PubMed=7542 800 ; 

RA Fleischmann R.D., Adams M.D., White O. , Clayton R.A. , Kirkness E.F., 

RA Kerlavage A.R., Bult C.J., Tomb J.-F., Dougherty B.A., Merrick J.M., 

RA McKenney K. , Sutton G. , Fitzhugh W. , Fields C.A. , Gocayne J.D., 

RA Scott J.D., Shirley R. , Liu L.-I. f Glodek A., Kelley J.M., 

RA Weidman J.F., Phillips C.A., Spriggs T., Hedblom E . , Cotton M.D., 

RA Utterback T.R., Hanna M.C., Nguyen D.T., Saudek D.M., Brandon R.C., 

RA Fine L.D., Fritchman J.L., Fuhrmann J.L., Geoghagen N.S.M., 

RA Gnehm C.L., McDonald L.A. , Small K.V., Fraser CM., Smith H.O., 

RA Venter J. C. ; 

RT "Whole-genome random sequencing and assembly of Haemophilus influenzae 

RT Rd."; 

RL Science 269:4 96-512(1995). 

RN [2] 

RP IDENTIFICATION AS NQR SYSTEM. 

RC STRAIN=Rd / KW20 / ATCC 51907; 

RX MEDLINE=96176316; PubMed=8 6014 4 9 ; 

RA Hayashi M. f Nakayama Y., Unemoto T.; 

RT "Existence of Na+-translocating NADH-quinone reductase in Haemophilus 

RT influenzae."; 

RL FEBS Lett. 381:174-176(1996). 

CC -!- FUNCTION: NQR complex catalyzes the reduction of ubiquinone-1 to 
CC ubiquinol by two successive reactions, coupled with the transport 

CC of Na( + ) ions from the cytoplasm to the periplasm. The first step 

CC is catalyzed by nqrF, which accepts electrons from NADH and 

CC reduces ubiquinone-1 to ubisemiquinone by a one-electron transfer 

CC pathway. 

CC -!- CATALYTIC ACTIVITY: NADH + ubiquinone + Na( + ) (In) = NAD ( + ) + 
CC ubiquinol + Na(+)(Out). 

CC -!- COFACTOR: FAD and a 2Fe-2S cluster (By similarity). 

CC -!- SUBUNIT: Composed of six subunits; nqrA, nqrB, nqrC, nqrD, nqrE 

CC and nqrF (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Inner membrane (Potential). 

CC -!- SIMILARITY: Belongs to the nqrF family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 



CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U32702; AAC21841.1; -. 

DR PIR; D64052; D64052. 

DR HSSP; P23486; 1QFJ. 

DR TIGR; HI0171; 

DR HAMAP; MF_00430; -; 1. 

DR InterPro; IPR006058; 2Fe2S_fd_BS. 

DR InterPro; IPR008333; FAD_binding_6 . 

DR InterPro; IPR001041; Ferredoxin. 

DR InterPro; IPR001709; FPN_cyt_redctse . 

DR InterPro; IPR001433; Oxred_FAD/NAD (P) . 

DR Pfam; PF00970; FAD_binding__6; 1. 

DR Pfam; PF00111; fer2; 1. 

DR Pfam; PF00175; NAD_binding_l; 1. 

DR PRINTS; PR00371; FPNCR. 

DR PROSITE; PS00197; 2 FE2 S_FERREDOXIN ; FALSE_NEG. 

KW Oxidoreductase; NAD; Ubiquinone; Transport; Sodium transport; 

KW Flavoprotein; FAD; Metal-binding; Iron-sulfur; Iron; 2Fe-2S; 

KW Transmembrane; Inner membrane; Complete proteome. 



FT TRANSMEM 


5 


27 




POTENTIAL . 




FT DOMAIN 


38 


120 




FERREDOXIN. 




FT DOMAIN 


276 


393 




CATALYTIC. 




FT METAL 


73 


73 




IRON-SULFUR (2FE-2S) (BY 


SIMILARITY) 


FT METAL 


79 


79 




IRON-SULFUR (2FE-2S) (BY 


SIMILARITY) 


FT METAL 


82 


82 




IRON-SULFUR (2FE-2S) (BY 


SIMILARITY) 


FT METAL 


114 


114 




IRON-SULFUR (2FE-2S) (BY 


SIMILARITY) 


SQ SEQUENCE 


411 AA; 


45705 MW; 


ED560053E66D23E0 CRC64; 




Query Match 




68. 


3%; 


Score 28; DB 1; Length 


411; 


Best Local , 


Similarity 


57. 


1%; 


Pred. No. 1.7e+02; 





Matches 4; Conservative 2; Mismatches 1; Indels 0; Gaps 0; 

Qy 2 VFFAEDF 8 

: I : III 

Db 320 IFYQEDF 326 



RESULT 80 
CRT2_BOVIN 

ID CRT2_BOVIN STANDARD; PRT; 421 AA. 

AC P42918; 

DT 01-NOV-1995 (Rel. 32, Created) 

DT 01-NOV-1995 (Rel. 32, Last sequence update) 

DT 15-JUL-1998 (Rel. 36, Last annotation update) 

DE Calreticulin, brain isoform 2 precursor (CRP55) (Calregulin) (HACBP) . 

OS Bos taurus (Bovine) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla; Ruminantia ; Pecora; Bovoidea; 

OC Bovidae; Bovinae; Bos. 

OX NCBI_TaxID=9913; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 



RX MEDLINE-93385184; PubMed-8373827 ; 

RA Liu N . , Fine R.E., Johnson R.J.; 

RT "Comparison of cDNAs from bovine brain coding for two isoforms of 

RT calreticulin. " ; 

RL Biochim. Biophys . Acta 1202:70-76(1993). 

CC -!- FUNCTION: This protein binds calcium. There are both high and low 
CC affinity calcium-binding sites. 

CC -!- SUBUNIT: Monomer (By similarity). 

CC -!- SUBCELLULAR LOCATION: Endoplasmic reticulum lumen. 

CC -!- SIMILARITY: Belongs to the calreticulin family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the . European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; L13462; AAC37307.1; -. 

DR PIR; S36799; S36799. 



DR 


InterPro; 


IPR009033; Calret_ 


calnex P. 


DR 


Inter Pro ; 


IPR001580; Calreticulin. 


DR 


InterPro; 


IPR008985; ConA_like_lec_gl . 


DR 


InterPro; 


I PRO 0 0 8 8 6 ; ER_tar get_S . 


DR 


Pfam; PF00262; calreticulin; 


1. 


DR 


PIRSF; PIRSF002356; Calreticulin; 1. 


DR 


PRINTS; PR00626; 


CALRETICULIN. 


DR 


ProDom; PD001866; 


Calreticulin; 1. 


DR 


PROSITE; 


PS00014; 


ER TARGET; 


1. 


DR 


PROSITE; 


PS00803; 


CALRETICULIN_1; 1. 


DR 


PROSITE; 


PS00804; 


CALRETICULIN_2; 1. 


DR 


PROSITE; 


PS00805; 


CALRETICULIN REPEAT; 3. 


KW 


Endoplasmic reticulum; Calcium-binding; Repeat; Signal. 


FT 


SIGNAL 


1 


34 


POTENTIAL. 


FT 


CHAIN 


35 


421 


CALRETICULIN, BRAIN ISOFORM 2. 


FT 


DOMAIN 


35 


201 


N- DOMAIN . 


FT 


DOMAIN 


202 


312 


P-DOMAIN. 


FT 


DOMAIN 


313 


421 


C-DOMAIN. 


FT 


DOMAIN 


195 


259 


4 X APPROXIMATE REPEATS. 


FT 


REPEAT 


195 


206 


1-1. 


FT 


REPEAT 


214 


225 


1-2. 


FT 


REPEAT 


231 


242 


1-3. 


FT 


REPEAT 


248 


259 


1-4. 


FT 


DOMAIN 


263 


301 


3 X APPROXIMATE REPEATS. 


FT 


REPEAT 


263 


273 


2-1. 


FT 


REPEAT 


277 


287 


2-2. 


FT 


REPEAT 


291 


301 


2-3. 


FT 


DOMAIN 


366 


411 


ASP/GLU/LYS-RICH. 


FT 


DISULFID 


141 


167 


BY SIMILARITY. 


FT 


CARBOHYD 


183 


183 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


418 


421 


PREVENT SECRETION FROM ER. 


SQ 


SEQUENCE 


421 AA; 48812 MW; 0257E959F71528BC CRC64; 



Query Match 68.3%; Score 28; DB 1; Length 421; 

Best Local Similarity 71.4%; Pred. No. 1.8e+02; 

Matches 5; Conservative 1; Mismatches 1; Indels 0; Gaps 



0; 



Qy 2 VFFAEDF 8 

I I I : I I 

Db 38 VFFSEQF 4 4 



RESULT 81 
E1BL_ADEM1 

ID E1BL_ADEM1 STANDARD; PRT; 433 AA. 

AC P12536; 

DT 01-OCT-1989 (Rel. 12, Created) 

DT 01-OCT-1989 (Rel. 12, Last sequence update) 

DT 01-FEB-1991 (Rel. 17, Last annotation update) 

DE E1B protein, large T-antigen (55 kDa protein) . 

OS Mouse adenovirus type 1 (MAV-1) . 

OC Viruses; dsDNA viruses, no RNA stage; Adenoviridae; Mastadenovirus . 

OX NCBI_TaxID=10530; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=89012162; PubMed-3172335 ; 

RA Ball A.O., Williams M.E., Spindler K.R.; 

RT "Identification of mouse adenovirus type 1 early region 1: DNA 

RT sequence and a conserved transactivating function."; 

RL J. Virol, 62:3947-3957(1988). 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

DR EMBL; M22245; AAA42425.1; -. 

DR InterPro; IPR002612; AdenoElB_55kDa . 

DR Pfam; PF01696; Adeno_ElB_55K; 1. 

KW Early protein. 

SQ SEQUENCE 433 AA; 49056 MW; B67197DB24AD2D95 CRC64; 

Query Match 68.3%; Score 28; DB 1; Length 433; 

Best Local Similarity 85.7%; Pred. No. 1.8e+02; 

Matches 6; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 2 VFFAEDF 8 

I I I I II 

Db 198 VFFACDF 204 



RESULT 82 
NTPB_ENTHR 

ID NTPB_ENTHR STANDARD; PRT; 458 AA. 

AC Q08637; 

DT 01-NOV-1995 (Rel. 32, Created) 

DT 01-NOV-1995 (Rel. 32, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE V-type sodium ATP synthase subunit B (EC 3.6.3.15) (Na(+)- 

DE translocating ATPase subunit B) . 



GN NTPB. 

OS Enterococcus hirae. 

OC Bacteria; Firmicutes; Lactobacillales ; Enterococcaceae; Enterococcus. 

OX NCBI_TaxID=1354 ; 

RN [1] 

RP SEQUENCE FROM N.A. , AND PARTIAL SEQUENCE OF 1-25. 

RC STRAIN— ATCC 97 90; 

RX MEDLINE-93280117; PubMed=85052 93 ; 

RA Takase K., Yamato I., Kakinuma Y. ; 

RT "Cloning and sequencing of the genes coding for the A and B subunits 

RT of vacuolar-type Na(+)-ATPase from Enterococcus hirae. Coexistence of 

RT vacuolar- and FOFl-type ATPases in one bacterial cell."; 

RL J. Biol. Chem. 268:11610-11616(1993). 

CC -!- FUNCTION: Involved in ATP-driven sodium extrusion. 

CC -!- CATALYTIC ACTIVITY: ATP + H(2)0 = ADP + phosphate. 

CC -!- INDUCTION: By increasing intracellular Na{+) concentration. 

CC -!- SIMILARITY: Belongs to the ATPase alpha/beta chains family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; D13816; BAA02970.1; 

DR EMBL; D17462; BAA04276.1; 

DR PIR; B46733; B46733. 

DR HAMAP; MF_00310; -; 1. 

DR InterPro; IPR000793; ATPase_a/b_C . 

DR InterPro; IPR000194; ATPase_a/bcentre . 

DR InterPro; IPR004100; ATPase_a/bN. 

DR InterPro; IPR009005; Fl_ATPase_a/bN . 

DR Pfam; PF00006; ATP-synt_ab; 1. 

DR Pfam; PF00306; ATP-synt_ab_C ; 1. 

DR Pfam; PF02874; ATP-synt_ab_N; 1. 

DR PROSITE; PS00152; AT PAS E_AL PHA_B ET A ; 1. 

KW Hydrolase; ATP synthesis; Sodium transport. 

SQ SEQUENCE 458 AA; 51141 MW; D8 9D8F2F58B652 62 CRC64; 



Query Match 68.3%; Score 28; DB 1; Length 458; 

Best Local Similarity 83.3%; Pred. No. 1.9e+02; 

Matches 5; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 3 FFAEDF 8 

II III 

Db 193 FFMEDF 198 



RESULT 83 
CRTD_RHOSH 

ID CRTD_RHOSH STANDARD; PRT; 4 95 AA. 

AC Q01671; 

DT 01-JUL-1993 (Rel. 26, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 



DE Methoxyneurosporene dehydrogenase (EC 1.14.99.-). 

GN CRTD . 

OS Rhodobacter sphaeroides (Rhodopseudomonas sphaeroides) . 

OC Bacteria; Proteobacteria; Alphaproteobacteria; Rhodobacterales ; 

OC Rhodobacteraceae; Rhodobacter. 

OX NCBI_TaxID=1063; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=ATCC 17023 / 2.4.1 / NCIB 8253 / DSM 158; 

RX MEDLINE=92307398; PubMed=1612412 ; 

RA Gari E., Toledo J.C., Gibert I., Barbe J.; 

RT "Nucleotide sequence of the methoxyneurosporene dehydrogenase gene 

RT from Rhodobacter sphaeroides: comparison with other bacterial 

RT carotenoid dehydrogenases."; 

RL FEMS Microbiol. Lett. 72:103-108(1992). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN-ATCC 17023 / 2.4.1 / NCIB 8253 / DSM 158; 

RX MEDLINE=95238278; PubMed=7721699 ; 

RA Lang H.P., Cogdell R.J., Takaichi S., Hunter C.N.; 

RT "Complete DNA sequence, specific Tn5 insertion map, and gene 

RT assignment of the carotenoid biosynthesis pathway of Rhodobacter 

RT sphaeroides. 1 '; 

RL J. Bacteriol. 177:2064-2073(1995). 

CC -!- FUNCTION: CONVERTS HYDROXYNEUROSPORENE TO DEMETHYLSPHEROIDENE OR 
CC METHOXYNEUROSPORENE TO SPHEROIDENE . 

CC -!- COFACTOR: FAD (Probable). 

CC -!- PATHWAY: Carotenoid and chlorophyll biosynthesis. 

CC -!- SIMILARITY: Belongs to the phytoene dehydrogenase family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X63204; CAA44886.1; -. 

DR EMBL; AJ010302; CAB38743.1; ALT_SEQ. 

DR PIR; S23633; S23633. 

DR InterPro; IPR002937; Amino_oxidase . 

DR InterPro; IPR008150; Bac_phytoene_dh . 

DR InterPro; IPR000205; NAD_BS . 

DR Pfam; PF01593; Amino_oxidase; 1. 

DR PROSITE; PS00982; PHYTOENE_DH; 1. 

KW Photosynthesis; Chlorophyll biosynthesis; Carotenoid biosynthesis; 

KW Oxidoreductase; FAD; Flavoprotein; NAD. 



FT 


NP_BIND 


9 


42 


FAD (ADP PART) (POTENTIAL) 


FT 


CONFLICT 


115 


115 


A -> G (IN REF. 2) . 


FT 


CONFLICT 


157 


157 


T -> P (IN REF. 2) . 


FT 


CONFLICT 


170 


170 


L -> M (IN REF. 2) . 


FT 


CONFLICT 


273 


273 


L -> I (IN REF. 2) . 


FT 


CONFLICT 


440 


443 


PHGA -> ATGP (IN REF. 1) . 


SQ 


SEQUENCE 


495 AA; 


52900 


MW; 62227931415B253E CRC64; 



Query Match 



68.3%; Score 28; DB 1; Length 495; 



Best Local Similarity 83.3%; Pred. No. 2.1e+02; 

Matches 5; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 VFFAED 7 

1111:1 

Db 331 VFFADD 336 



RESULT 84 
SYK_METMP 

ID SYK_METMP STANDARD; PRT; 533 AA. 

AC 030522; 

DT 15-JUL-1998 (Rel . 3 6, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Lysyl-tRNA synthetase (EC 6.1.1.6) (Lysine — tRNA ligase) (LysRS) . 

GN LYSS. 

OS Methanococcus maripaludis . 

OC Archaea; Euryarchaeota; Methanococci ; Methanococcales ; 

£)C Methanococcaceae ; Methanococcus . 

OX NCBI_TaxID=39152; 

RN [1] 

RP SEQUENCE FROM N. A. , AND SEQUENCE OF 1-22. 

RC STRAIN=JJ; 

RX MEDLINE=98 0162 82; PubMed-93531 92 ; 

RA Ibba M. , Morgan S., Curnow A.W., Pridmore D.R., Vothknecht U.C., 

RA Gardner W., Lin W., Woese C.R., Soell D.; 

RT "A euryarchaeal lysyl-tRNA synthetase: resemblance to class I 

RT synthetases."; 

RL Science 278:1119-1122(1997). 

RN [2] 

RP CHARACTERIZATION. 

RX MEDLINE=2 05704 60; PubMed=1112102 8 ; 

RA Soell D . , Becker H.D., Plateau P., Blanquet S., Ibba M. ; 

RT "Context-dependent anticodon recognition by class I lysyl-tRNA 

RT synthetases."; 

RL Proc. Natl. Acad. Sci. U.S.A. 97:14224-14228(2000). 

CC -!- CATALYTIC ACTIVITY: ATP + L-lysine + tRNA(Lys) = AMP + diphosphate 

CC + L-lysyl-tRNA(Lys) . 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- MISCELLANEOUS: Able to charge E.coli tRNA(Lys) in vitro. 

CC -!- SIMILARITY: Belongs to class-I aminoacyl-tRNA synthetase family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF009824 ; . AAB87410 . 1; 

DR PIR; T46975; T46975. 

DR HAMAP; MF_00177; -; 1. 

DR InterPro; IPR002904; Lys_tRNA-synt_lc . 

DR InterPro; IPR008925; tRNA-synt_bind. 

DR InterPro; IPR001412; tRNA-synt I. 



DR Pfam; PF01921; tRNA~synt_lf ; 1. 

DR TIGRFAMs; TIGR004 67 ; . lysS_arch; 1. 

DR PROSITE; PS00178; AA_TRNA_LIGASE_I ; 1. 

KW Aminoacyl-tRNA synthetase; Protein biosynthesis; Ligase; ATP-binding 

FT SITE 28 36 "HIGH" REGION. 

FT SITE 278 282 "KMSKS" REGION. 

FT CONFLICT 11 11 I -> L (IN REF. 1; AA SEQUENCE) . 

SQ SEQUENCE 533 AA; 61273 MW; 007 FA8 6 8 A5 7 A0AC2 CRC64; 



Query Match 68.3%; Score 28; DB 1; Length 533; 

Best Local Similarity 50.0%; Pred. No. 2.2e+02; 

Matches 4; Conservative 3; Mismatches 1; Indels 



0; Gaps 



0; 



Qy 

Db 



1 LVFFAEDF 8 

I : I I : I : 

59 LIFIADDY 66 



RESULT 85 
CH62_CHLCV 

ID CH62_CHLCV STANDARD; PRT; 536 AA. 

AC P59698; 

DT 10-OCT-2003 (Rel. 42, Created) 

DT 10-OCT-2003 (Rel. 42, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE 60 kDa chaperonin 2 (Protein Cpn60 2) (groEL protein 2) . 

GN GR0L2 OR GROEL2 OR GROEL- 2 OR CCA00980. 

OS Chlamydophila caviae. 

OC Bacteria; Chlamydiae; Chlamydiales ; Chlamydiaceae; Chlamydophila. 

OX NCBI_TaxID=83557 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=GPIC; 

RX MEDLINE=22569155; PubMed=12682364 ; 

RA Read T.D., Myers G.S.A., Brunham R.C., Nelson W.C., Paulsen I.T., 

RA Heidelberg J., Holtzapple E. , Khouri H. , Federova N.B., Carty H.A., 

RA Umayam L.A., Haft D.H., Peterson J., Beanan M.J., White O., 

RA Salzberg S.L., Hsia R.-C, McClarty G., Rank R.G., Bavoil P.M., 

RA Fraser CM. ; 

RT "Genome sequence of Chlamydophila caviae (Chlamydia psittaci GPIC) : 

RT examining the role of niche-specific genes in the evolution of the 

RT Chlamydiaceae."; 

RL Nucleic Acids Res. 31:2134-2147(2003). 

CC -!- FUNCTION: Prevents misfolding and promotes the refolding and 
CC proper assembly of unfolded polypeptides generated under stress 

CC conditions (By similarity) . 

CC -!- SUBUNIT: Oligomer of 14 subunits composed of two stacked rings of 
CC 7 subunits (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic (By similarity). 

CC -!- SIMILARITY: Belongs to the chaperonin (HSP60) family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 



CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AE016997; AAP05719.1; -. 

DR TIGR; CCA00980; 

DR HAMAP; MF_00600; -; 1. 

DR InterPro; IPR001844; Chaprnin_Cpn60 . 

DR InterPro; IPR002423; Cpn60/TCP-l. 

DR InterPro; IPR008950; GroEL-ATPase . 

DR PRINTS; PR00298; CHAPERONIN60 . 

DR PRINTS; PR00304; TC0MPLEXTCP1 . 

DR PROSITE; PS00296; CHAPERONINS_CPN60 ; FALSE_NEG. 

KW Chaperone; ATP-binding; Complete proteome. 

SQ SEQUENCE 536 AA; 57848 MW; 053B62380116A1DD CRC64 ; 



Query Match 68.3%; Score 28; DB 1; Length 53 6; 

Best Local Similarity 62.5%; Pred. No. 2.2e+02; 

Matches 5; Conservative 1; Mismatches 2; Indels 0; Gaps 0; 



Qy 1 LVFFAEDF 8 

I : I I I I 

Db 248 LIIIAEDF 255 



RESULT 86 
COXl RHILE 



ID COXl_RHILE STANDARD; PRT; 538 AA. 

AC Q08855; 

DT 01-FEB-1996 (Rel. 33, Created) 

DT 01-FEB-1996 (Rel. 33, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Cytochrome c oxidase polypeptide I (EC 1.9.3.1) (Cytochrome AA3 

DE subunit 1) . 

GN CTAD OR COXA. 

OS Rhizobium leguminosarum. 

OC Bacteria; Proteobacteria ; Alphaproteobacteria; Rhizobiales; 

OC Rhizobiaceae; Rhizobium/ Agrobacterium group; Rhizobium. 

OX NCBI_TaxID=384; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=94161496; PubMed=8117073 ; 

RA Gabel C, Bittinger M.A. , Maier R.J.; 

RT "Cytochrome aa3 gene regulation in members of the family 

RT Rhizobiaceae: comparison of copper and oxygen effects in 

RT Brady rhizobium japonicum and Rhizobium tropici."; 

RL Appl. Environ. Microbiol. 60:141-148(1994). 

CC -!- FUNCTION: Cytochrome c oxidase is the component of the respiratory 
CC chain that catalyzes the reduction of oxygen to water. Subunits 

CC 1-3 form the functional core of the enzyme complex. CO I is the 

CC catalytic subunit of the enzyme. Electrons originating in 

CC cytochrome c are transferred via the copper A center of subunit 2 

CC and heme A of subunit 1 to the bimetallic center formed by heme A3 

CC and copper B. 

CC -!- CATALYTIC ACTIVITY: 4 f errocytochrome c + 0(2) =4 f erricytochrome 
CC c + 2 H(2)0. 

CC -!- PATHWAY: Respiratory chain; terminal step. 

CC -!- SUBCELLULAR LOCATION: INTEGRAL MEMBRANE PROTEIN. CONTAINS 12 
CC POTENTIAL TRANSMEMBRANE DOMAINS. 



cc 


- ! - DEVELOPMENTAL 


STAGE: FREE IN SOIL (NOT AS BACTEROID) . 


CC 

cc 
cc 


-!- SIMILARITY: Belongs to the heme-copper respiratory oxidase family. 


This SWISS-PROT entry is copyright. It is produced through a collaboration 


cc 


between 


the Swiss 


Institute 


of Bioinformatics and the EMBL outstation - 


cc 


the European Bioinf ormatics 


Institute. There are no restrictions on its 


cc 


use by 


non-profit institutions as long as its content is in no way 


cc 


modified 


and this 


statement 


is not removed. Usage by and for commercial 


cc 


entities 


requires 


a license 


agreement (See http://www.isb-sib.ch/announce/ 


cc 
cc 

DR 


or send an email to license@isb-sib . ch) . 


EMBL; X74341; CAA52388.1; 




DR 


PIR; S36424; S36424. 




DR 


HSSP; P98002; 1AR1 


* 




DR 


InterPro ; 


IPR000883; COX1 . 




DR 


Pfam; PF00115; COX1; 1. 




DR 


PRINTS; PR01165; CYCOXIDASEI 


# 


DR 


PROSITE; 


PS00077; 


COX1; 1. 




KW 


Oxidoreductase; Heme; Copper 


; Transmembrane; Respiratory chain. 


FT 


TRANSMEM 


37 


57 


POTENTIAL. 


FT 


TRANSMEM 


84 


104 


POTENTIAL . 


FT 


TRANSMEM 


121 


141 


POTENTIAL. 


1 Till 

FT 


TRANSMEM 


170 


190 


POTENTIAL. 


FT 


TRANSMEM 


208 


228 


POTENTIAL. 


FT 


TRANSMEM 


259 


279 


POTENTIAL. 


FT 


TRANSMEM 


291 


311 


POTENTIAL. 


FT 


TRANSMEM 


329 


349 


POTENTIAL. 


FT 


TRANSMEM 


362 


382 


POTENTIAL. 


FT 


TRANSMEM 


401 


421 


POTENTIAL. 


FT 


TRANSMEM 


438 


458 


POTENTIAL. 


FT 


TRANSMEM 


476 


496 


POTENTIAL. 


FT 


METAL 


82 


82 


IRON (HEME A AXIAL LIGAND) (PROBABLE) . 


FT 


METAL 


265 


265 


COPPER B (PROBABLE) . 


FT 


METAL 


269 


269 


COPPER B (PROBABLE). 


FT 


METAL 


314 


314 


COPPER B (PROBABLE). 


FT 


METAL 


315 


315 


COPPER B (PROBABLE). 


FT 


METAL 


400 


400 


IRON (HEME A3 AXIAL LIGAND) (PROBABLE) . 


FT 


METAL 


402 


402 


IRON (HEME A AXIAL LIGAND) (PROBABLE) . 


FT 


CROSSLNK 


265 


269 


1 ' -histidyl-3 ' -tyrosine (His-Tyr ) 


FT 








(By similarity) . 


SQ 


SEQUENCE 


538 AA; 


59133 MW; 74 0C8FE6C902D9BF CRC64; 



Query Match 68.3%; Score 28; DB 1; Length 538; 

Best Local Similarity 62.5%; Pred. No. 2.3e+02; 

Matches 5; Conservative 1; Mismatches 2; Indels 0; Gaps 0; 



Qy 1 LVFFAEDF 8 

I : I I I I 

Db 447 LIFFPEHF 454 



RESULT 87 
YABN_ECOLI 

ID YABN_ECOLI STANDARD; PRT; 551 AA. 

AC P33595; P75638; 

DT 01-FEB-1994 (Rel. 28, Created) 

DT 10-OCT-2003 (Rel. 42, Last sequence update) 



DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Hypothetical protein yabN. 

GN YABN OR BO 069. 

OS Escherichia coli . 

OC Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacteriales ; 

OC Enterobacteriaceae; Escherichia . 

OX NCBI JTaxID=562 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=K12; 

RX MEDLINE=92334 977; PubMed=1630901 ; 

RA Yura T., Mori H., Nagai H., Nagata T., Ishihama A., Fujita N . , 

RA Isono K., Mizobuchi K., Nakata A.; 

RT "Systematic sequencing of the Escherichia coli genome: analysis of 

RT the 0-2.4 min region."; 

RL Nucleic Acids Res. 2 0:3305-3308(1992). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=K12 / MG1655; 

RX MEDLINE=97426617 ; PubMed=9278503 ; 

RA Blattner F.R., Plunkett G. Ill, Bloch C.A. , Perna N.T., Burland V., 

RA Riley M. , Collado-Vides J., Glasner J.D., Rode C.K., Mayhew G.F., 

RA Gregor J., Davis N.W., Kirkpatrick H.A. , Goeden M.A. , Rose D.J., 

RA Mau B. , Shao Y. ; 

RT "The complete genome sequence of Escherichia coli K-12." ; 

RL Science 277:1453-1474(1997). 

CC -!- SIMILARITY: TO E.COLI YBAE. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; D10483; BAB96638.1; -. 

DR EMBL; AE000117; AAC73180.1; -. 

DR PIR; E64728; E64728. 

DR EcoGene; EG12094; yabN. 

DR InterPro; IPR000914 ; SBP_bac_5 . 

DR Pfam; PF004 96; SBP_bac_5; 1. 

KW Hypothetical protein; Complete proteome. 

FT CONFLICT 356 356 V -> A (IN REF. 1) . 

FT CONFLICT 384 384 L -> F (IN REF. 1) . 

FT CONFLICT 409 409 Y -> C (IN REF. 1) . 

SQ SEQUENCE 551 AA; 63975 MW; 0 F4 FAA1 AAFC 4 E7 3 1 CRC64 ; 

Query Match 68.3%; Score 28; DB 1; Length 551; 

Best Local Similarity 71.4%; Pred. No. 2.3e+02; 

Matches 5; Conservative 2; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 LVFFAED 7 

I I : I I I : 
Db 365 LVYFAEE 371 



RESULT 8 8 
APB3 HUMAN 



ID APB3_HUMAN STANDARD; PRT; 575 AA. 

AC 096018; 060483; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Amyloid beta A4 precursor protein-binding family A member 3 (Neuron- 

DE specific X11L2 protein) (Neuronal Muncl8-l-interacting protein 3) 

DE (Mint-3) (Adapter protein Xllgamma) . 

GN APBA3 OR MINT3 OR X11L2. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=99160409; PubMed=1004 97 67 ; 

RA Tanahashi H., Tabira T.; 

RT "X11L2, a new member of Xll protein family interacts with Alzheimer's 

RT beta-amyloid precursor protein."; 

RL Biochem. Biophys. Res. Commun. 255:663-667(1999). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Lamerdin J.E., McCready P.M., Skowronski E., Viswanathan V., 

RA Burkhart-Schultz K., Brower A., Gordon L., Dias J., Ramirez M. , 

RA Stilwagen S., Phan H . , Velasco N . , Do L., Regala W., Terry A., 

RA Games J., Danganan L., Erler A., Christensen M. , Georgescu A., 

RA Avila J., Liu S., Attix C, Andreise T., Trankheim M. , 

RA Amico-Keller G., Coefield J., Duarte S., Lucas S., Bruce R. , 

RA Thomas P., Quan G., Kronmiller B., Arellano A., Sanders C, Ow D., 

RA Nolan M. , Trong S., Kobayashi A., Olsen A.S., Carrano A.V. ; 

RL Submitted (NOV-1998) to the EMBL/GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE OF 472-575 FROM N.A. 

RX MEDLINE=99075474; PubMed=9860131 ; 

RA Okamoto M. , Suedhof T.C.; 

RT "Mint 3: a ubiquitous mint isoform that does not bind to muncl8-l or 

RT -2."; 

RL Eur. J. Cell Biol. 77:161-165(1998). 

CC -!- FUNCTION: May modulate processing of the beta-amyloid precursor 
CC protein (APP) and hence formation of beta-APP. 

CC -!- SUBUNIT: Binds to the cytoplasmic domain of amyloid protein (APP) 
CC in vivo . 

CC -!- TISSUE SPECIFICITY: Expressed in all the tissues examined with 
CC lower levels in brain and testis. 

CC -!- DOMAIN: Composed of an N-terminal domain, a middle 

CC phosphotyrosine-binding domain (PID/PTB) that mediates binding 

CC with the cytoplasmic domain of the beta-amyloid precursor protein, 

CC and two C-terminal PDZ domains thought to attach proteins to the 

CC plasma membrane. 

CC -!- SIMILARITY: Contains 2 PDZ/DHR domains. 

CC -!- SIMILARITY: Contains 1 PID domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 



CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AB021638; BAA74430.1 

DR EMBL; AC005954; AAC72275.1 

DR EMBL; AF029110; AAC17979.1 

DR PIR; JG0181; JG0181. 

DR HSSP; Q02410; 1AQC. 

DR Genew; HGNC:580; APBA3 . 

DR MIM; 604262; -. 

DR GO; GO: 0016020; Crmembrane; NAS . 

DR GO; GO: 0005515; F:protein binding; IPI. 

DR GO; GO: 0007242; P : intracellular signaling cascade; NAS. 

DR InterPro; IPR001478; PDZ. 

DR InterPro; IPR006020; PTB_PID. 

DR Pfam; PF00595; PDZ; 2. 



DR 


Pfam; PF00640; PID; 1. 




DR 


SMART; SM00228; PDZ; 2. 




DR 


SMART; SM00462; PTB; 1. 




DR 


PROSITE; 


PS50106; 


PDZ; 2. 




DR 


PROSITE; 


PS01179; 


PID; 1. 




KW 


Protein 


transport; 


Repeat; 


Polymorphism. 


FT 


DOMAIN 


165 


171 


POLY-SER. 


FT 


DOMAIN 


217 


381 


PID. 


FT 


DOMAIN 


394 


480 


PDZ 1. 


FT 


DOMAIN 


485 


560 


PDZ 2. 


FT 


VARIANT 


527 


527 


I -> F (in dbSNP: 1045236) . 


FT 








/FTId=VAR_011822. 


FT 


CONFLICT 


505 


505 


I -> IVRPRPLAPGWGGRAALSTAPEQPPPLSRAPLFLPQ 


FT 








(IN REF. 3) . 


SQ 


SEQUENCE 


575 AA; 


61454 


MW; 3B910CC74C5F3840 CRC64; 



Query Match 68.3%; Score 28; DB 1; Length 575; 

Best Local Similarity 83.3%; Pred. No. 2.4e+02; 

Matches 5; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 



Qy 2 VFFAED 7 

11:111 

Db 332 VFYAED 337 



RESULT 89 
DFA1_ANASP 

ID DFA1_ANASP STANDARD; PRT; 576 AA. 

AC Q8YNW5; 

DT 10-OCT-2003 (Rel. 42, Created) 

DT 10-OCT-2003 (Rel. 42, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Putative diflavin flavoprotein A 1 (EC 1.-.-.-). 

GN DFA1 OR ALL4446. 

OS Anabaena sp. (strain PCC 7120) . 

OC Bacteria; Cyanobacteria; Nostocales; Nostocaceae; Nostoc. 

OX NCBI_TaxID=103690; 
RN [1] 



RP 
RX 
RA 
RA 
RA 
RA 
RA 
RT 
RT 
RL 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 



SEQUENCE FROM N.A. 

MEDLI NE=2 1595285; PubMed= 11759840; 

Kaneko T., Nakamura Y. , Wolk CP., Kuritz T., Sasamoto S., 
Watanabe A., Iriguchi M. , Ishikawa A., Kawashima K., Kimura T 
Kishida Y., Kohara M. , Matsumoto M., Matsuno A., Muraki A. , 
Nakazaki N . , Shimpo S., Sugimoto M., Takazawa M. , Yamada M. , 
Yasuda M., Tabata S.; 

"Complete genomic sequence of the filamentous nitrogen-fixing 
cyanobacterium Anabaena sp. strain PCC 7120."; 
DNA Res. 8:205-213(2001). 



- ! - 



_ I _ 



_ \ - 



— I 



_ t _ 



FUNCTION: Mediates electron transfer from NADH to oxygen, reducing 
it to water. This modular protein has 3 redox cof actors, in other 
organisms the same activity requires 2 or 3 proteins (By 
similarity) . 

MISCELLANEOUS: By homology with norV in E.coli, could be involved 
in nitric oxide detoxification (By similarity) . 
SIMILARITY: In the N-terminal section; belongs to the zinc 
metallo-hydrolase family group 3. 

SIMILARITY: In the C-terminal section; belongs to the flavodoxin 
reductase family. 

SIMILARITY: Contains 1 f lavodoxin-like domain. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; AP003596; BAB76145.1; -. 
PIR; AF2361; AF2361. 

InterPro; IPR001279; Blactmase-like . 
InterPro; IPR008254; Flav_nitox_synth . 
InterPro; IPR002563; Flavin_Reduct . 
InterPro; IPR009002; FMN_binding. 
Pfam; PF01613; Flavin_Reduct ; 1. 
Pfam; PF00753; lactamase B; 1. 



DR 


PROSITE; 


PS50902; 


FLAVODOXIN_ 


LIKE; 1. 


KW 


Hypothetical protein; Transport; Electron transport; Oxidoreductase; 


KW 


Flavoprotein; Metal-binding; 


Iron; Multigene family; 


KW 


Complete 


proteome . 






FT 


DOMAIN 


48 


240 


ZINC METALLO-HYDROLASE. 


FT 


DOMAIN 


269 


431 


FLAVODOXIN-LIKE. 


FT 


DOMAIN 


432 


576 


FLAVODOXIN-REDUCTASE-LIKE. 


FT 


METAL 


97 


97 


IRON 1 (BY SIMILARITY) . 


FT 


METAL 


99 


99 


IRON 1 (BY SIMILARITY) . 


FT 


METAL 


101 


101 


IRON 2 (BY SIMILARITY) . 


FT 


METAL 


164 


164 


IRON 1 (BY SIMILARITY) . 


FT 


METAL 


183 


183 


IRON 1 AND 2 (BY SIMILARITY) . 


FT 


METAL 


240 


240 


IRON 2 (BY SIMILARITY) . 


SQ 


SEQUENCE 


576 AA; 


64088 MW; 


AC885FEFEB8C694D CRC64; 



Query Match 68.3%; Score 28; DB 1; Length 576; 

Best Local Similarity 57.1%; Pred. No. 2.4e+02; 

Matches 4; Conservative 3; Mismatches 0; Indels 0; Gaps 



0; 



Qy 2 VFFAEDF 8 

: I : I I I : 
Db 271 LFYAEDY 277 



RESULT 90 
NEP2JTHEVU 

ID NEP2_THEVU STANDARD; PRT; 585 AA. 

AC Q08751; 

DT 01-NOV-1995 (Rel. 32, Created) 

DT 01-NOV-1995 (Rel. 32, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Neopullulanase 2 (EC 3.2.1.135) (Alpha-amylase II) (TVA II). 

GN TVAII. 

OS Thermoactinomyces vulgaris . 

OC Bacteria; Firmicutes ; Bacillales; Thermoactinomycetaceae ; 

OC Thermoactinomyces . 

OX NCBI_TaxID=2026; 

RN [1] 

RP SEQUENCE FROM N.A., AND CHARACTERIZATION. 

RC STRAIN=R-47; 

RX MEDLINE=93222535; PubMed=77 6354 0 ; 

RA Tonozuka T., Ohtsuka M. , Mogi S.-I., Sakai H., Ohta T., Sakano Y.; 

RT "A neopullulanase-type alpha-amylase gene from Thermoactinomyces 

RT vulgaris R-47."; 

RL Biosci. Biotechnol. Biochem. 57:395-401(1993). 

RN [2] 

RP X-RAY CRYSTALLOGRAPHY (2.6 ANGSTROMS). 

RC STRAIN=R-47 ; 

RX MEDLINE=99241045; PubMed=10222200; 

RA Kamitori S., Kondo S., Okuyama K. , Yokota T., Shimura Y. , Tonozuka T . , 

RA Sakano Y. ; 

RT "Crystal structure of Thermoactinomyces vulgaris R-47 alpha-amylase II 

RT (TVAII) hydrolyzing cyclodextrins and pullulan at 2 . 6-A resolution."; 

RL J. Mol. Biol. 287:907-921(1999). 

RN [3] 

RP X-RAY CRYSTALLOGRAPHY (2.3 ANGSTROMS). 

RX MEDLINE=22047855; PubMed=12051850; 

RA Kamitori S., Abe A., Ohtaki A., Kaji A. , Tonozuka T . , Sakano Y. ; 

RT "Crystal structures and structural comparison of Thermoactinomyces 

RT vulgaris R-47 alpha-amylase 1 (TVAI) at 1.6 A resolution and 

RT alpha-amylase 2 (TVAII) at 2,3 A resolution."; 

RL J. Mol. Biol. 318:443-453(2002). 

CC -!- FUNCTION: Hydrolyzes pullulan efficiently but only a small amount 
CC of starch. Endohydrolysis of 1, 4-alpha-glucosidic linkages in 

CC pullulan to form panose. Cleaves also ( 1-6) -alpha-glucosidic 

CC linkages to form maltotriose. 

CC -!- CATALYTIC ACTIVITY: Hydrolysis of pullulan to panose (6-alpha-D- 
CC glucosylmaltose) . 

CC -!- COFACTOR: Binds 1 calcium ion per subunit. 

CC -!- SUBUNIT: Monomer. 

CC -!- SIMILARITY: Belongs to family 13 of glycosyl hydrolases. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 
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modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; D13178; BAA02473.1; 
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InterPro; IPR006047; Alpha_amyl_cat . 
InterPro; IPR004185; Glyco_hydro_13Ig . 
InterPro; IPR007110; Ig-like. 
Pfam; PF00128; alpha-amylase; 1. 
Pfam; PF02903; alpha-amylase N; 1. 
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FT STRAND 504 511 

FT TURN 512 515 

FT STRAND 516 523 

FT TURN 524 525 

FT STRAND 526 533 

FT STRAND 539 544 

FT STRAND 553 555 

FT STRAND 561 561 

FT STRAND 564 565 

FT TURN 566 567 

FT STRAND 568 573 

FT TURN 575 576 

FT STRAND 578 582 

SQ SEQUENCE 585 AA; 67467 



MW; E311813A05A7791A CRC64; 



Query Match 68.3%; Score 28; DB 1; Length 585; 

Best Local Similarity 83.3%; Pred. No. 2.5e+02; 

Matches 5; Conservative 0; Mismatches 1; Indels 



0; Gaps 



Qy 



Db 



3 FFAEDF 8 

II III 
268 FFIEDF 273 



RESULT 91 
CRAC_DICDI 

ID CRAC_DICDI STANDARD; PRT; 697 AA. 

AC P35401; 

DT 01-JUN-1994 (Rel. 29, Created) 

DT 01-FEB-1996 (Rel. 33, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Protein CRAC. 

GN DAGA. 

OS Dictyostelium discoideum (Slime mold) . 

OC Eukaryota; Mycetozoa; Dictyosteliida ; Dictyostelium. 

OX NCBI_TaxID=44689; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=AX4; 

RX MEDLINE=94375528; PubMed=8 089184 ; 

RA Insall R. , Kupsa A., Lilly P.J., Shaulsky G., Levin L.R., Loomis W.F. 

RA Devreotes P.N.; 

RT "CRAC, a cytosolic protein containing a pleckstrin homology domain, 

RT is required for receptor and G protein-mediated activation of 

RT adenylyl cyclase in Dictyostelium."; 

RL J. Cell Biol. 126:1537-1545(1994). 

RN [2] 

RP SEQUENCE OF 1-8. 

RX MEDLINE=94245733; PubMed=8188693 ; 

RA Lilly P.J., Devreotes P.N.; 

RT "Identification of CRAC, a cytosolic regulator required for guanine 

RT nucleotide stimulation of adenylyl cyclase in Dictyostelium."; 

RL J. Biol. Chem. 269:14123-14129(1994). 

CC -!- FUNCTION: Couples activated G protein to adenylyl cyclase signal 
CC transduction from surface cAMP receptor. 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- DEVELOPMENTAL STAGE: Tightly developmentally regulated. 
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-!- SIMILARITY: Contains 1 PH domain. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; U06228; AAA61782.1; 
PIR; A54796; A54796. 
DictyBase; DDB0001965; dagA. 
InterPro; IPR001849; PH. 
Pfam; PF00169; PH; 1. 
SMART; SM00233; PH; 1. 
PROSITE; PS50003; PH DOMAIN; 1. 
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Query Match 68.3%; Score 28; DB 1; Length 697; 

Best Local Similarity 83.3%; Pred. No. 2.9e+02; 

Matches 5; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 3 FFAEDF 8 

I I I I I 

Db 245 FFREDF 250 



RESULT 92 
YKW2_YEAST 

ID YKW2_YEAST STANDARD; PRT; 7 05 AA. 

AC P35995; 

DT 01-JUN-1994 (Rel. 29, Created) 

DT 01-JUN-1994 (Rel. 29, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Putative 82.2 kDa transcriptional regulatory protein in FRE2 5' region. 

GN YKL222C. 

OS Saccharomyces cerevisiae (Baker's yeast). 

OC Eukaryota; Fungi; Ascomycota; Saccharomycotina ; Saccharomycetes ; 

OC Saccharomycetales ; Saccharomycetaceae; Saccharomyces. 

OX NCBI_TaxID=4932; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE-94378726; PubMed=8091865 ; 

RA Alexandraki D., Tzermia M. ; 

RT "Sequencing of a 13.2 kb segment next to the left telomere of yeast 

RT chromosome XI revealed five open reading frames and recent 

RT recombination events with the right arms of chromosomes III and V."; 

RL Yeast 10 : S81-S91 ( 1994 ) . 

CC -!- SUBCELLULAR LOCATION: Nuclear (Probable). 

CC -!- SIMILARITY: Contains 1 Zn(2)-Cys(6) fungal-type binuclear cluster 
CC domain . 



cc 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X75950; CAA53551.1; -. 

DR EMBL; Z28222; CAA82067.1; -. 

DR PIR; S38066; S38066. 

DR HSSP; P12351; 1PYC. 

DR GermOnline; 139976; -. 

DR SGD; S0001705; YKL222C. 

DR InterPro; IPR007219; Fungal_trans . 

DR InterPro; IPR001138; Fungi_TrN. 

DR Pfam; PF04082; Fungal_trans ; 1. 

DR Pfam; PF00172; Zn_clus; 1. 

DR PRINTS ; PR00054 ; FUNGAL ZNCYS . 

DR SMART; SM00066; GAL4 ; 1. 

DR PROSITE; PS00463; ZN2_CY6_FUNGAL_1 ; 1. 

DR PROSITE; PS50048; ZN2_CY6_FUNGAL_2 ; 1. 

KW Hypothetical protein; Transcription regulation; DNA-binding; 

KW Nuclear protein; Zinc; Metal-binding. 

FT DNA_BIND 24 52 ZN (2 ) -CYS ( 6) , FUNGAL-TYPE. 

SQ SEQUENCE 705 AA; 82248 MW; 31BF148717D72E1C CRC64; 



Query Match 68.3%; Score 28; DB 1; Length 7 05; 

Best Local Similarity 83.3%; Pred. No. 3e+02; 

Matches 5; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 3 FFAEDF 8 

III II 

Db 201 FFASDF 206 



RESULT 93 
VIVJ3RYSA 

ID VIV_ORYSA STANDARD; PRT; 728 AA. 

AC P37398; 

DT 01-OCT-1994 (Rel. 30, Created) 

DT 01-OCT-1994 (Rel. 30, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Viviparous protein homolog. 

GN VP1. 

OS Oryza sativa (Rice) . 

OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; 

OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; 

OC Ehrhartoideae ; Oryzeae ; Oryza . 

OX NCBI_TaxID=4530; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE-94250843; PubMed-8193305 ; 

RA Hattori T., Terada T., Hamasuna S.; 

RT ' "Sequence and functional analyses of the rice gene homologous to the 

RT maize Vpl. " ; 



RL Plant Mol. Biol. 24:805-810(1994). 

CC -!- FUNCTION: Could participate in abscisic acid-regulated gene 
CC expression during seed development. 

CC -!- SIMILARITY: Contains 1 TF-B3 domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; D16640; BAA04066.1; -. 

DR TRANS FAC; T04785; -. 

DR Gramene; P37398; -. 

DR InterPro; IPR003340; TF_B3 . 

DR Pfam; PF02362; B3; 1. 

KW Developmental protein; Transcription regulation; DNA-binding; 

KW Activator . 

FT DOMAIN 500 685 TF-B3 . 

SQ SEQUENCE 728 AA; 75982 MW; F66882079F5FE428 CRC64; 

Query Match 68.3%; Score 28; DB 1; Length 72 8; 

Best Local Similarity 83.3%; Pred. No. 3e+02; 

Matches 5; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 VFFAED 7 

I I I I : I 

Db 4 6 VFFADD 51 



RESULT 94 
YNN2_YEAST 

ID YNN2_YEAST STANDARD; PRT; 1056 AA. 

AC P53914; 

DT 01-OCT-1996 (Rel. 34, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Hypothetical UPF0202 protein YNL132W. 

GN YNL132W OR N1216 OR N1858. 

OS Saccharomyces cerevisiae (Baker's yeast). 

OC Eukaryota; Fungi; Ascomycota; Saccharomycotina ; Saccharomycetes; 

OC Saccharomycetales ; Saccharomycetaceae ; Saccharomyces. 

OX NCBI_TaxID-4 932; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=S288c; 

RX MEDLINE=96109932; PubMed=8 619318 ; 

RA Mallet L., Bussereau F., Jacquet M. ; 

RT "A 43.5 kb segment of yeast chromosome XIV, which contains MFA2, 

RT MEP2, CAP/SRV2, NAM9, FKB1/FPR1/RBP1, MOM22 and CPT1, predicts an 

RT adenosine deaminase gene and 14 new open reading frames."; 

RL Yeast 11:1195-12 09(1995). 

CC -!- SUBCELLULAR LOCATION: Nuclear (By similarity). 

CC -\- SIMILARITY: Belongs to the UPF0202 family. 

CC : 



CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; Z46843; CAA86893.1; 

DR EMBL; Z71408; CAA96014.1; 

DR PIR; S55151; S55151. 

DR GermOnline; 143138; 

DR SGD; S0005076; YNL132W. 

DR InterPro; IPR007807; DUF699. 

DR Pfam; PF05127; DUF699; 1. 

KW Hypothetical protein; ATP-binding; Nuclear protein. 

FT NP_BIND 283 290 ATP (POTENTIAL) . 

SQ SEQUENCE 1056 AA; 119347 MW; 76721ED0867ED618 CRC64; 

Query Match 68.3%; Score 28; DB 1; Length 1056; 

Best Local Similarity 75.0%; Pred. No. 4.4e+02; 

Matches 6; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

II I I : II 
Db 774 LVEFAKDF 781 



RESULT 95 
CYA9_HUMAN 

ID CYA9_HUMAN STANDARD; PRT; 1353 AA. 

AC 060503; 060273; Q9BWT4; Q9UGP2; 

DT 15-JUL-1999 (Rel. 38, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Adenylate cyclase, type IX (EC 4.6.1.1) (ATP pyrophosphate-lyase) 

DE (Adenylyl cyclase) . 

GN ADCY9 OR KIAA0520. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Heart; 

RX MEDLINE=98292499; PubMed=9628 827 ; 

RA Hacker B.M., Tomlinson J.E., Wayman G.A., Sultana R. , Chan G. , 

RA Villacres E. , Disteche C, Storm D.R.; 

RT "Cloning, chromosomal mapping, and regulatory properties of the human 

RT type 9 adenylyl cyclase (ADCY9 ) . " ; 

RL Genomics 50:97-104(1998). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Paterson J.M., Smith S.M., Simpson J., Grace O.C., Bell J.E., 

RA Antoni F.A.; 

RT "Cloning and characterisation of human adenylyl cyclase IX: 

RT differential mRNA regulation and inhibition by Ca2+/calcineurin . " ; 



RL Submitted (FEB-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N . A. 

RA Toyota T., Yamada K., Meerabux J., Hattori E., Saito K. f 

RA Yoshitsugu K., Shimizu H., Nankai M. , Toru M. , Detera-Wadleigh S.D., 

RA Yoshikawa T.; 

RT "Mutation screening, case control study and transmission 

RT disequilibrium analysis of adenylate cyclase type 9 (ADCY9) gene in 

RT functional psychoses."; 

RL Submitted (MAR-2001) to the EMBL/ GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE OF 141-1353 FROM N.A. , AND REVISION TO 1154. 

RC TISSUE=Brain; 

RX MEDLINE=22158633; PubMed=12168954 ; 

RA Nakajima D., Okazaki N. , Yamakawa H., Kikuno R. , Ohara O., Nagase T.; 

RT "Construction of expression-ready cDNA clones for KIAA genes: manual 

RT curation of 330 KIAA cDNA clones."; 

RL DNA Res. 9:99-106(2002). 

RN [5] 

RP SEQUENCE OF 788-1353 FROM N.A. 

RC TISSUE^Brain; 

RX MEDLINE=98290545; PubMed-962858 1 ; 

RA Nagase T., Ishikawa K.-I-, Miyajima N., Tanaka A. , Kotani H. f 

RA Nomura N., Ohara O. ; 

RT "Prediction of the coding sequences of unidentified human genes. IX. 

RT The complete sequences of 100 new cDNA clones from brain which can 

RT code for large proteins in vitro."; 

RL DNA Res. 5:31-39(1998). 

CC -!- FUNCTION: May play a fundamental role in situations where fine 
CC interplay between intracellular calcium and cAMP determines the 

CC cellular function. May be a physiologically relevant docking site 

CC for calcineurin (By similarity) . 

CC -!- CATALYTIC ACTIVITY: ATP - 3', 5' -cyclic AMP + diphosphate. 

CC -!- COFACTOR: Binds 2 magnesium ions per subunit. 

CC -!- ENZYME REGULATION: Insensitive to calcium/calmodulin, forskolin 
CC and somatostatin. Stimulated by beta-adrenergic receptor 

CC activation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- DOMAIN: Composed of two homologous domains. 

CC -!- SIMILARITY: Belongs to the adenylyl cyclase class~4/guanylyl 
CC cyclase family. 

CC -!- SIMILARITY: Contains 1 FKBP-type PPIase domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

DR EMBL; AF036927; AAC24201.1; -. 

DR EMBL; AJ133123; CAB65084.1; -. 

DR EMBL; AY028959; AAK29464.1; -. 

DR EMBL; AY028949; AAK29464.1; JOINED. 

DR EMBL; AY028950; AAK29464.1; JOINED. 

DR EMBL; AY028951; AAK29464.1; JOINED. 
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G -> R (IN REF. 2) . 
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N -> S (IN REF. 2 AND 4) . 


FT 
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QHQLSISPDIRVQVDGSIGRSPTDEIANLVPSVQYVDKTSL 



FT GSDS STQAKDAHLS PKRPWKEPVKAEERGRFGKAI EKDDCD 

FT ETGI EEANELTKLNVSKSV -> APAVHLPRHPRPGGWQHR 

FT TVSHRRDCQPGAFCPVCGQDI SGF (IN REF. 1) . 

FT CONFLICT 1308 1308 P -> R (IN REF. 2). 

SQ SEQUENCE 1353 AA; 150699 MW; 4CBF051EA49B5B7B CRC64; 

Query Match 68.3%; Score 28; DB 1; Length 1353; 
Best Local Similarity 71.4%; Pred. No. 5.7e+02; 

Matches 5; Conservative 1; Mismatches 1; Indels 0; Gaps 

Qy 1 LVFFAED 7 

: I I I II 

Db 841 MVFFLED 847 

RESULT 96 
CYA9_MOUSE 

ID CYA9_MOUSE STANDARD; PRT; 1353 AA. 

AC P51830; Q61279; 

DT 01-OCT-1996 (Rel. 34, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Adenylate cyclase, type IX (EC 4.6.1.1) (ATP pyrophosphate-lyase) 

DE (Adenylyl cyclase) (Adenylyl cyclase type 10) (ACTP10) . 

GN ADCY9 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; TISSUE=Brain; 

RX MEDLINE=9 62 7 8 831; PubMed=8 6 62 814; 

RA Premont R.T., Matsuoka I., Mattel M.-G., Pouille Y., Defer N., 

RA Hanoune J. ; 

RT "Identification and characterization of a widely expressed form of 

RT adenylyl cyclase."; 

RL J. Biol. Chem. 271:13900-13907(1996). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=96024597; PubMed=7575502 ; 

RA Paterson J.M., Smith S.M., Harmar A. J., Antoni F.A. ; 

RT "Control of a novel adenylyl cyclase by calcineurin . " ; 

RL Biochem. Biophys. Res. Commun. 214:1000-1008(1995). 

RN [3] 

RP SEQUENCE OF 1106-1193 FROM N.A. 

RX MEDLINE=9 5 097788; PubMed-7 528319; 

RA Premont R.T. ; 

RT "Identification of adenylyl cyclases by amplification using 

RT degenerate primers . " ; 

RL Meth. Enzymol. 238:116-127(1994). 

CC -!- FUNCTION: May play a fundamental role in situations where fine 

CC interplay between intracellular calcium and cAMP determines the 

CC cellular function. May be a physiologically relevant docking site 

CC for calcineurin. 

CC -!- CATALYTIC ACTIVITY: ATP = 3 f , 5* -cyclic AMP 4- diphosphate. 

CC -!- COFACTOR: Binds 2 magnesium ions per subunit. 



CC -!- ENZYME REGULATION: Insensitive to cal cium/ calmodulin . Stimulated 
CC by magnesium, forskolin and mutationally activated G protein (GS)- 

CC alpha. Regulated by calcineurin. 

CC SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- TISSUE SPECIFICITY: Found in decreasing order in skeletal muscle, 
CC heart, adrenal gland, ovary and brain; and to a lesser extent, in 

CC kidney, liver, testis, lung, thymus and spleen. 

CC -!- DOMAIN: Composed of two homologous domains. 

CC -!- SIMILARITY: Belongs to the adenylyl cyclase class-4/guanylyl 
CC cyclase family. 

CC -!- SIMILARITY: Contains 1 FKBP-type PPIase domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration. 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U30602; AAC52603.1; -. 

DR EMBL; Z50190; CAA90570.1; 

DR PIR; JC4279; JC4279. 

DR HSSP; P26769; 1AB8 . 

DR MGD; MGI: 108450; Adcy9 . 

DR InterPro; IPR001054; G_cyclase. 

DR Pfam; PF00211; guanylate_cyc; 2. 

DR SMART; SM00044; CYCc; 2. 

DR PROSITE; PS00452; GUANYLATE_CYCLASES_1 ; 1. 

DR PROSITE; PS50125; GUANYLATE_CYCLASES_2 ; 2. 

KW Lyase; cAMP biosynthesis; Transmembrane; Glycoprotein; Repeat; 

KW Metal-binding; Magnesium; Isomerase; Rotamase. 
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Query Match 68.3%; 
Best Local Similarity 71.4%; 
Matches 5; Conservative 



Score 28; DB 1; Length 1353; 
Pred. No. 5.7e+02; 
1; Mismatches 1; Indels 



0 ; Gap 



QY 
Db 



1 LVFFAED 7 

: I I I II 

841 MVFFLED 847 



RESULT 97 










t r> 

J- u 


~BGS2 SCHPO STANDARD; PRT; 1894 AA. 






013967; Q96WS3; 






28-FEB-2003 (Rel. 41, Created) 




D 1 


28-FEB-2003 (Rel. 41, Last sequence update) 






28-FEB-2003 (Rel. 41, Last annotation update) 




DE 


1, 3-beta-glucan synthase component bgs2 (EC 2.4.1.34) (1,3-beta- 


D- 


DE 


glucan-UDP glucosyltransf erase) (Meiotic expression upregulated 




DE 


protein 21) . 




GN 


BGS2 OR MEU21 OR SPAC24C9 . 07C. 




OS 


Schizosaccharomyces pombe (Fission yeast). 




OC 


Eukaryota; Fungi; Ascomycota; Schizosaccharomycetes ; 




OC 


Schizosaccharomycetales ; Schizosaccharomycetaceae; 




OC 


Schizosaccharomyces . 




OX 


NCBI TaxID=4896; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=972; 




RX 


MEDLINE=21848401; PubMed=11859360 ; 




RA 


Wood V., Gwilliam R. , Rajandream M.A. , Lyne M. , Lyne R. , Stewart 


A 


RA 


Sgouros J., Peat N., Hayles J., Baker S., Basham D., Bowman S., 




RA 


Brooks K., Brown D., Brown S., Chillingworth T., Churcher CM., 




RA 


Collins M., Connor R. , Cronin A., Davis P., Feltwell T., Fraser 


A. 


RA 


Gentles S., Goble A., Hamlin N., Harris D., Hidalgo J., Hodgson 


G. 


RA 


Holroyd S., Hornsby T., Howarth S., Huckle E.J., Hunt S., Jagels 


K 


RA 


James K. , Jones L., Jones M. , Leather S., McDonald S., McLean J. 


/ 


RA 


Mooney P., Moule S., Mungall K. , Murphy L., Niblett D., Odell C. 


r 


RA 


Oliver K., O f Neil S., Pearson D., Quail M.A. , Rabbinowitsch E., 




RA 


Rutherford K. , Rutter S., Saunders D., Seeger K. , Sharp S., 




RA 


Skelton J., Simmonds M., Squares R., Squares S., Stevens K., 




RA 


Taylor K. , Taylor R.G., Tivey A., Walsh S.V., Warren T., Whitehead 


RA 


Woodward J., Volckaert G., Aert R. , Robben J. , Grymonprez B., 




RA 


Weltjens I., Vanstreels E., Rieger M. , Schaefer M. , Mueller-Auer 


S 


RA 


Gabel C, Fuchs M. , Fritzc C, Holzer E., Moestl D., Hilbert H., 




RA 


Borzym K., Langer I., Beck A., Lehrach H., Reinhardt R. , Pohl T. 


M. 



RA Eger P., Zimmermann W., Wedler H., Wambutt R., Purnelle B., 

RA Goffeau A., Cadieu E . , Dreano S., Gloux S., Lelaure V., Mottier S., 

RA Galibert F., Aves S.J., Xiang Z., Hunt C, Moore K., Hurst S.M., 

RA Lucas M. , Rochet M. , Gaillardin C, Tallada V.A. , Garzon A., Thode G. , 

RA Daga R.R., Cruzado L., Jimenez J., Sanchez M., del Rey F. , Benito J., 

RA Dominguez A., Revuelta J.L., Moreno S., Armstrong J., Forsburg S.L., 

RA Cerrutti L., Lowe T., McCombie W.R., Paulsen I., Potashkin J., 

RA Shpakovski G.V., Ussery D., Barrell B.G., Nurse P.; 

RT "The genome sequence of Schizosaccharomyces pombe."; 

RL Nature 415:871-880(2002). 

RN [2] 

RP SEQUENCE OF 1717-1894 FROM N. A. 

RC STRAIN=CD16-1; 

RX MEDLINE-21270454; PubMed=1137 615 1 ; 

RA Watanabe T., Miyashita K. , Saito T.T., Yoneki T . , Kakihara Y., 

RA Nabeshima K., Kishi Y.A., Shimoda C, Nojima H.; 

RT "Comprehensive isolation of meiosis-specif ic genes identifies novel 

RT proteins and unusual non-coding transcripts in Schizosaccharomyces 

RT pombe."; 

RL Nucleic Acids Res. 29:2327-2337(2001). 

RN [3] 

RP FUNCTION, SUBCELLULAR LOCATION, AND INDUCTION. 

RX MEDLINE=20521933; PubMed=11069657 ; 

RA Martin V., Ribas J.C., Carnero E., Duran A., Sanchez Y.; 

RT "bgs2+ / a sporulation-specif ic glucan synthase homologue is required 

RT for proper ascospore wall maturation in fission yeast."; 

RL Mol. Microbiol. 38:308-321(2000). 

RN [4] 

RP FUNCTION, AND SUBCELLULAR LOCATION. 

RX MEDLINE=20382733; PubMed=l 092247 8 ; 

RA Liu J., Tang X., Wang H., Balasubramanian M. ; 

RT "Bgs2p, a 1, 3-beta-glucan synthase subunit, is essential for 

RT maturation of ascospore wall in Schizosaccharomyces pombe."; 

RL FEBS Lett. 478:105-108(2000). 

CC -!- FUNCTION: Has a role in ascospore development where it is required 
CC for the assembly of a functional spore wall. 

CC -!- CATALYTIC ACTIVITY: UDP-glucose + { ( 1 , 3 ) -beta-D-glucosyl } (N) = UDP 
CC + { (1,3) -beta-D-glucosyl} (N+l) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Associated with 
CC the forespore inner membrane. 

CC -!- INDUCTION: By sporulation. 

CC -!- SIMILARITY: Belongs to the glycosyltrans f erase family 48. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; Z98601; CAB11264.2; -. 

DR EMBL; AB054305; BAB60872.1; -. 

DR GeneDB_SPombe; SPAC24C9 . 07c; -. 

DR InterPro; IPR003440; Glyco_trans_4 8 . 

DR Pfam; PF02364; Glucan_synthase; 1. 

KW Transmembrane; Transferase; Glycosyltransferase; Meiosis; Sporulation. 
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SQ 


SEQUENCE 


1894 


AA; 219181 


MW; 2DBA2 8A471AAED21 



Query Match 68.3%; 
Best Local Similarity 75.0%; 
Matches 6; Conservative 



Score 28; DB 1; Length 1894; 
Pred. No. 7.9e+02; 
0; Mismatches 2; Indels 



0; Gaps 



Qy 



Db 



1 LVFFAEDF 8 

I III II 

1772 LNFFAADF 1779 



RESULT 98 
CA36_CHICK 

ID CA36_CHICK STANDARD; PRT; 3137 AA. 

AC P15989; 

DT 01-APR-1990 (Rel. 14, Created) 

DT 01-FEB-1996 (Rel. 33, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Collagen alpha 3 (VI) chain precursor. 

GN COL6A3 . 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae 

OC Gallus. 

OX NCBI__TaxID=9031; 

RN [1] 

RP SEQUENCE OF 1-853 FROM N.A. 

RC TISSUE^Aorta; 

RX MEDLINE=91035630; PubMed=1977751 ; 

RA Doliana R. , Bonaldo P., Colombatti A.; 

RT "Multiple forms of chicken alpha 3 (VI) collagen chain generated by 

RT alternative splicing in type A repeated domains."; 

RL J. Cell Biol. 111:2197-22 05(1990). 

RN [2] 

RP SEQUENCE OF 224-2871 FROM N.A. 

RX MEDLINE=90212613; PubMed=2 322559 ; 

RA Bonaldo P., Russo V., Bucciotti F., Doliana R. , Colombatti A. ; 

RT "Structural and functional features of the alpha 3 chain indicate a 

RT bridging role for chicken collagen VI in connective tissues."; 

RL Biochemistry 29:1245-1254(1990). 

RN [3] 

RP SEQUENCE OF 2871-3137 FROM N.A. 

RX MEDLINE=90062147; PubMed=2584214 ; 

RA Bonaldo P., Colombatti A.; 

RT "The carboxyl terminus of the chicken alpha 3 chain of collagen VI i 

RT a unique mosaic structure with glycoprotein lb-like, fibronectin typ 

RT III, and Kunitz modules."; 

RL J. Biol. Chem. 264:20235-20239(1989). 

CC -!- FUNCTION: Collagen VI acts as a cell-binding protein. 

CC -!- SUBUNIT: Trimers composed of three different chains: alpha 1(VI) 

CC alpha 2 (VI), and alpha 3 (VI). 



CC -!- ALTERNATIVE PRODUCTS: 

CC Event-Alternative splicing; Named isof orms-1 ; 

CC Comment=At least 2 isoforms are produced; 

CC Name=l; 

CC IsoId=Pl5989-l; Sequence=Displayed; 

CC -!- PTM: Prolines at the third position of the tripeptide repeating 
CC unit (G-X-Y) are hydroxylated in some or all of the chains. 

CC -!- SIMILARITY: Contains 1 BPTI/Kunitz inhibitor domain. 

CC -!- SIMILARITY: Contains 1 fibronectin type III domain. 

CC -!- SIMILARITY: Contains 12 VWFA domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; M24282; AAA03201.1; -. 

DR PIR; A37797; A37797. 

DR HSSP; P12111; 2KNT. 

DR InterPro; IPR008161; Clg_helix. 

DR InterPro; IPR008160; Collagen. 

DR InterPro; IPR003961; FN_III- 

DR InterPro; IPR002223; Kunitz__BPTI . 

DR InterPro; IPR002035; VWF_A. 

DR Pfam; PF01391; Collagen; 6. 

DR Pfam; PF00041; fn3; 1- 

DR Pfam; PF00014; Kunitz_BPTI; 1. 

DR Pfam; PF00092; vwa; 11. 

DR PRINTS; PR00759; BASICPTASE. 

DR PRINTS; PR00453; VWFADOMAIN. 

DR ProDom; PD000007; Clg_helix; 1. 

DR ProDom; PD000222; Kunitz_BPTI; 1. 

DR SMART; SM00060; FN3; 1. 

DR SMART; SM00131; KU; 1. 

DR SMART; SM00327; VWA; 12. 

DR PROSITE; PS00280; BPTI_KUNITZ_1 ; 1. 

DR PROSITE; PS50279; BPTI_KUNITZ_2 ; 1. 

DR PROSITE; PS50234; VWFA; 12. 

KW Extracellular matrix; Connective tissue; Repeat; Hydroxylation; 

KW Glycoprotein; Cell adhesion; Serine protease inhibitor; Collagen; 

KW Signal; Alternative splicing. 
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) (POTENTIAL) . 


SQ 


SEQUENCE 


3137 AA; 339595 


MW; ECB42857 8B536357 


CRC64; 


Query Match 




68. 


3%; 


Score 28; DB 1; Length 3137; 


Best Local 


Similarity 50. 


0%; 


Pred. No. 1.3e+03; 




Matches 


4; Conservative 




3; Mismatches 1; 


Indels 0; 


Qy 


1 


LVFFAEDF 


8 








Db 


800 


: 1 : 1 : 1 1 
MVYFMDDF 


807 









RESULT 99 
MIH_CANPG 

ID MIH_CANPG STANDARD; PRT; 7 8 AA. 

AC P55846; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Molt-inhibiting hormone (MIH) . 

OS Cancer pagurus (Rock crab) . 

OC Eukaryota; Metazoa; Arthropoda; Crustacea; Malacostraca; 

OC Eumalacostraca; Eucarida; Decapoda; Pleocyemata; Brachyura; 

OC Eubrachyura; Cancroidea; Cancridae; Cancer. 

OX NCBI_TaxID=6755; 

RN [1] 

RP SEQUENCE. 

RC TISSUE^Sinus gland; 

RX MEDLINE=97021946; PubMed=88 683 06 ; 

RA Chung J.S., Wilkinson M. C . , Webster S.G.; 

RT "Determination of the amino acid sequence of the moult-inhibiting 

RT hormone from the edible crab, Cancer pagurus."; 

RL Neuropeptides 30:95-101(1996). 

CC -!- FUNCTION: Inhibits Y-organs where molting hormone (ecdysteroid) 
CC secreted. A molting cycle is initiated when MIH secretion 



Q7X225 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OX 
RN 
RP 
RA 
RA 
RT 
RT 
RL 
DR 
KW 
SQ 



261 AA. 



(strain Mu50 / ATCC 700699) 
Bacillales ; Staphylococcus . 



Q7X225 PRELIMINARY; PRT; 

Q7X225; 

01-OCT-2003 (TrEMBLrel. 25, Created) 
01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 
01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 
GTP pyrophosphokinase . 
YIBM. 

Staphylococcus aureus 
Bacteria; Firmicutes ; 
NCBI_TaxID=158878; 

[1] 

SEQUENCE FROM N.A. 
Wootton M., Avison M.B., 
Walsh T. R. ; 

"Genetic analysis of seventeen genes in Staphylococcus aureus with 
reduced susceptibility to vancomycin (VRSA) and hetero-VRSA (hVRSA) 
Submitted (MAY-2003) to the EMBL/ GenBank/DDBJ databases. 
EMBL; AJ564070; CAD91854.1; 
Kinase . 

SEQUENCE 261 AA; 30592 MW; CE58B6DE0F263166 CRC64; 



Bennett P.M., Howe R.A. , MacGowan A. P. 



ti 



Query Match 78.0%; 
Best Local Similarity 85.7%; 
Matches 6; Conservative 



Score 32; DB 2; Length 261; 
Pred. No. 1.4e+02; 
1; Mismatches 0 ; Indels 



0; Gaps 



Qy 

Db 



1 LVFFAED 7 

I I I I : I I 
158 LVFFSED 164 



RESULT 41 
Q7WRM0 

ID Q7WRM0 PRELIMINARY; 

AC Q7WRM0 ; 

DT 01-OCT-2003 (TrEMBLrel. 25, 

DT 01-OCT-2003 (TrEMBLrel. 25, 

DT 01-OCT-2003 (TrEMBLrel. 25, 

DE GTP pyrophosphokinase. 

GN YIBM. 

OS Staphylococcus aureus . 

OC Bacteria; Firmicutes; Bacillales; Staphylococcus. 

OX NCBI_TaxID=1280; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Mu3, Michigan, New Jersey, Glasgow3700, Glasgow3759, PC3, St 

RC Luke, Sweden307, LIM2, LIM3, PCI, Sweden309, FranceDuf, Slovenia6096, 

RC Southampton23, LIMl, Norwayl018, SMH2, LiverpoolAG, LLA, LLE, 

RC SMH11888, SMH18000, SMH18034, SMH18037, SMH12248, SMH8997, SMH10501, 

RC SMH17608, SMH17487, and SMH14017; 

RA Wootton M., Avison M.B., Bennett P.M., Howe R.A. , MacGowan A. P., 

RA Walsh T. R. ; 

RT "Genetic analysis of seventeen genes in Staphylococcus aureus with 

RT reduced susceptibility to vancomycin (VRSA) and hetero-VRSA (hVRSA) . " 

RL Submitted (MAY-2003) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AJ564071; CAD91855.1; -. 

DR EMBL; AJ564072; CAD91856.1; 



PRT; 2 61 AA. 
Created) 

Last sequence update) 
Last annotation update) 
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DR 


EMBL, 


; AJ564097, 
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DR 


EMBL, 


- AJ564098, 


? CAD91882. 


1; 




DR 


EMBL, 
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; CAD91883. 
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DR 


EMBL, 


? AJ564100, 
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1; 




DR 


EMBL, 
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KW 


Kinase . 








SQ 


SEQUENCE 261 


AA; 30592 


MW; 



CE58B6DE0F263166 CRC64 ; 



Query Match 78.0%; 
Best Local Similarity 85.7%; 
Matches 6; Conservative 



Score 32; DB 2; Length 261; 
Pred. No. 1.4e+02; 
1; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 



Db 



1 LVFFAED 7 

I I I I : I I 
158 LVFFSED 164 



RESULT 42 
Q99V8 9 

ID Q99V89 PRELIMINARY; PRT; 261 AA. 

AC Q99V8 9; 

DT 01-JUN-2001 (TrEMBLrel . 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Hypothetical protein SA08 60. 

GN SAO 8 60. 

OS Staphylococcus aureus (strain N315) . 

OC Bacteria; Firmicutes; Bacillales; Staphylococcus. 

OX NCBI_TaxID=158879; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=2 13 1 1952; PubMed=l 1 418146; 

RA Kuroda M. , Ohta T., Uchiyama I., Baba T., Yuzawa H., Kobayashi I 



RA Cui L., Oguchi A. , Aoki K.I., Nagai Y., Lian J., I to T . , Kanamori M. , 

RA Matsumaru H., Maruyama A., Murakami H., Hosoyama A., Mizutani-Ui Y., 

RA Takahashi N.K., Sawano T., Inoue R.I., Kaito C, Sekimizu K., 

RA Hirakawa H., Kuhara S., Goto S., Yabuzaki J., Kanehisa M., 

RA Yamashita A., Oshima K., Furuya K. , Yoshino C, Shiba T., Hattori M. , 

RA Ogasawara N., Hayashi H., Hiramatsu K. ; 

RT "Whole genome sequencing of meticillin-resis tant Staphylococcus 

RT aureus . " ; 

RL Lancet 357:1225-1240(2001). 

DR EMBL; AP003132; BAB42101.1; 

DR PIR; B89868; B89868. 

KW Hypothetical protein; Complete proteome. 

SQ SEQUENCE 261 AA; 30592 MW; CE58B6DE0F263166 CRC64; 

Query Match 78.0%; Score 32; DB 16; Length 261; 

Best Local Similarity 85.7%; Pred. No. 1.4e+02; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 LVFFAED 7 

I I I I : I I 

Db 158 LVFFSED 164 



RESULT 43 
Q8NXD0 

ID Q8NXD0 PRELIMINARY; PRT; 268 AA. 

AC Q8NXD0; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

DE Hypothetical protein MW0883. 

GN MW0883. 

OS Staphylococcus aureus (strain MW2) . 

OC Bacteria; Firmicutes; Bacillales; Staphylococcus. 

OX NCBI_TaxID=196620; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22 04 0717; PubMed=12044378 ; 

RA Baba T. , Takeuchi F. , Kuroda M. , Yuzawa H., Aoki K.-I-, Oguchi A., 

RA Nagai Y., Iwama N., Asano K., Naimi T., Kuroda H., Cui L., 

RA Yamamoto K., Hiramatsu K. ; 

RT "Genome and virulence determinants of high virulence community- 

RT acquired MRSA."; 

RL Lancet 359:1819-1827 (2002) . 

DR EMBL; AP004825; BAB94748.1; -. 

KW Hypothetical protein; Complete proteome. 

SQ SEQUENCE 268 AA; 31363 MW; F40B877 8 60BD1140 CRC64; 

Query Match 78.0%; Score 32; DB 16; Length 268; 

Best Local Similarity 85.7%; Pred. No. 1.5e+02; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 LVFFAED 7 

I I I I : I I 

Db 165 LVFFSED 171 



RESULT 4 4 
Q83NF3 

ID Q83NF3 PRELIMINARY; PRT; 379 AA. 

AC Q83NF3; 

DT 01-JUN-2003 (TrEMBLrel. 24, Created) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel . 25, Last annotation update) 

DE Putative integral membrane transport protein. 

GN TW57 5. 

OS Tropheryma whipplei (strain TW08/27) (Whipple's bacillus). 

OC Bacteria; Actinobacteria; Actinobacteridae ; Actinomycetales ; 

OC Micrococcineae; Cellulomonadaceae; Tropheryma. 

OX NCBI_TaxID=218496; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22495039; PubMed=12606174 ; 

RA Bentley S.D., Maiwald M. , Murphy L.D., Pallen M.J., Yeats C.A., 

RA Dover L.G., Norbertczak H.T., Besra G.S., Quail M.A., Harris D.E., 

RA von Herbay A., Goble A., Rutter S., Squares R., Squares S., 

RA Barrell B.G., Parkhill J., Relman D.A. ; 

RT "Sequencing and analysis of the genome of the Whipple f s disease 

i 

RT bacterium Tropheryma whipplei."; 

RL Lancet 361:637-644(2003). 

DR EMBL; BX251412; CAD67241.1; -. 

DR GO; GO: 0016020; C:membrane; IEA. 

DR GO; GO: 0005215; F: transporter activity; IEA. 

DR GO; GO: 0006810; P: transport; IEA. 

DR InterPro; IPR007114; MFS . 

DR InterPro; IPR005828; Sub_transporter . 

DR Pfam; PF00083; sugar_tr; 1. 

DR PROSITE; PS50850; MFS; 1. 

KW Complete proteome. 

SQ SEQUENCE 379 AA; 39462 MW; 0E0F9053BACCADF2 CRC64; 



Query Match 78.0%; Score 32; DB 16; Length 379; 

Best Local Similarity 75.0%; Pred. No. 2.1e+02; 

Matches 6; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

I I I I I I : 
Db 274 LVFFAESY 281 



RESULT 45 
Q83N16 

ID Q83N16 PRELIMINARY; PRT; 390 AA. 

AC Q83N16; 

DT 01-JUN-2003 (TrEMBLrel. 24, Created) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Multidrug efflux protein. 

GN QACA OR TWT197. 

OS Tropheryma whipplei (strain Twist) (Whipple's bacillus). 

OC Bacteria; Actinobacteria; Actinobacteridae; Actinomycetales; 

OC Micrococcineae; Cellulomonadaceae; Tropheryma. 

OX NCBI_TaxID=2 032 67 ; 

RN [1] 



RP SEQUENCE FROM N.A. 

RA Raoult D., Audic S., Robert C, Ogata H., Suhre K., Drancourt M. , 

RA Claverie J.-M.; 

RT "Tropheryma whipplei illustrates the diversity of gene loss patterns 

RT in small genome bacterial pathogens."; 

RL Submitted (JUL-2002) to the EMBL/GenBank/DDB J databases. 

DR EMBL; AE016850; AA044294.1; -. 

DR GO; GO: 0016020; C:membrane; IEA. 

DR GO; GO: 0005215; F: transporter activity; IEA. 

DR GO; GO: 0006810; Prtransport; IEA. 

DR InterPro; IPR007114; MFS . 

DR InterPro; IPR005828; Sub_transporter . 

DR ' Pfam; PF00083; sugar_tr; 1. 

DR PROSITE; PS50850; MFS; 1. 

KW Complete proteome . 

SQ SEQUENCE 390 AA; 40609 MW; 099B4A82C5AAA8B2 CRC64; 



Query Match 78.0%; 
Best Local Similarity 75.0%; 
Matches 6; Conservative 



Score 32; DB 16; Length 390; 
Pred. No. 2.1e+02; 
1; Mismatches 1; Indels 0; 



Gaps 



0; 



Qy 

Db 



1 LVFFAEDF 8 

I I I I I I : 

2 85 LVFFAESY 2 92 



RESULT 4 6 
087160 

ID 087160 PRELIMINARY; 

AC 087160; 

DT 01-NOV-1998 (TrEMBLrel . 08, 

DT 01-NOV-1998 (TrEMBLrel. 08, 

DT 01-JUN-2003 (TrEMBLrel. 24, 

DE WblD protein. 

GN WBLD . 

OS Vibrio cholerae. 

OC Bacteria; Proteobacteria; Gammaproteobacteria; Vibrionales; 

OC Vibrionaceae; Vibrio. 

OX NCBI_TaxID=666; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=022; 

RX MEDLINE=994532 93; PubMed=10521656; 

RA Yamasaki S., Shimizu T., Hoshino K. , Ho S . -T . , Shimada T . , Nair G.B., 

RA Takeda Y. ; 

RT "The genes responsible for O-antigen synthesis of Vibrio cholerae 0139 

RT are closely related to those of Vibrio cholerae 022."; 

RL Gene 237:321-332(1999). 

DR EMBL; AB012957; BAA33635.1; -. 

DR PIR; T44331; T44331. 

SQ SEQUENCE 398 AA; 47080 MW; 7A1998E8C402BB74 CRC64; 

Query Match 78.0%; Score 32; DB 2; Length 398; 

Best Local Similarity 85.7%; Pred. No. 2.2e+02; 

Matches 6; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



PRT; 398 AA. 

Created) 

Last sequence update) 
Last annotation update) 



2 VFFAEDF 8 



1 1 1 1 II 

Db 264 VFFARDF 270 



RESULT 47 
Q7ULM6 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RC 
RX 
RA 
RA 
RA 
RT 
RT 
RL 
DR 
KW 
SQ 



656 AA. 



Q7ULM6 PRELIMINARY; PRT; 

Q7ULM6; 

01-OCT-2003 (TrEMBLrel. 25, Created) 

01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

Hypothetical protein. 

RB9405. 

Rhodopirellula baltica. 

Bacteria; Planctomycetes ; Planctomycetacia; Planctomycetales ; 
Planctomycetaceae; Pirellula. 
NCBI__TaxID=117; 
[1] 

SEQUENCE FROM N.A. 

STRAIN=1; . . 

MEDLINE=22735913; PubMed=12835416; 

Gloeckner F.O., Kube M. , Bauer M. , Teeling H., Lombardot T., 
Ludwig W. , Gade D., Beck A., Borzym K., Heitmann K., Rabus R. , 
Schlesner H. , Amann R., Reinhardt R.; 

"Complete genome sequence of the marine planctomycete Pirellula sp. 
strain 1 . " ; 

Proc* Natl. Acad. Sci. U.S.A. 100:8298-8303(2003). 

EMBL; BX294149; CAD76243.1; -. 

Hypothetical protein; Complete proteome. 

SEQUENCE 656 AA; 75050 MW; 977C88 8 8D2510F02 CRC64; 



Query Match 78.0%; 
Best Local Similarity 75.0%; 
Matches 6; Conservative 



Score 32; DB 16; Length 656; 
Pred. No. 3.6e+02; 
0; Mismatches 2; Indels 



0; Gaps 



0; 



Qy 

Db 



1 LVFFAEDF 8 

I III II 
530 LAFFASDF 537 



RESULT 48 
Q8A6R7 

ID Q8A6R7 PRELIMINARY; PRT; 1676 AA. 

AC Q8A6R7 ; 

DT 01-JUN-2003 (TrEMBLrel. 24, Created) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Conserved protein, with weak BamHI domain. 

GN BT1809. 

OS Bacteroides thetaiotaomicron . 

OC Bacteria; Bacteroidetes ; Bacteroides (class); Bacteroidales ; 

OC Bacteroidaceae; Bacteroides. 

OX NCBI_TaxID=818; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=VPI-5482 / ATCC 29148; 

RX MEDLINE=2 2550858; PubMed= 12 663928; 



RA Xu J., Bjursell M.K., Himrod J., Deng S., Carmichael L.K., 

RA Chiang H.C., Hooper L.V., Gordon J. I.; 

RT "A genomic view of the human-Bacteroides thetaiotaomicron symbiosis."; 

RL Science 299:2074-2076(2003). 

DR EMBL; AE016933; AA076916.1; 

DR GO; GO: 0005509; F: calcium ion binding; IEA. 

DR InterPro; IPR008938; ARM. 

DR InterPro; IPR002048; EF-hand. 

DR PROSITE; PS00018; EF_HAND; 1. 

KW Complete proteome. 

SQ SEQUENCE 1676 AA; 193674 MW; 28065878C0F6C961 CRC64; 



Query Match 78.0%; Score 32; DB 16; Length 1676; 

Best Local Similarity 71.4%; Pred. No. 9.4e+02; 

Matches 5; Conservative 2; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 LVFFAED 7 

: : I I I I I 

Db 1657 IIFFAED 1663 



RESULT 4 9 
Q7Z443 

ID Q7Z443 PRELIMINARY; PRT; 1732 AA. 

AC Q7Z443; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Polycystic kidney disease 1-like 3. 

GN PKD1L3. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Testis; 

RX MEDLINE=22666619; PubMed-12782129 ; 

RA Li A., Tian X., Sung S.-W., Somlo S.; 

RT "Identification of two novel polycystic kidney disease-l-like genes in 

RT human and mouse genomes."; 

RL Genomics 81:596-608(2003). 

DR EMBL; AY164485; AA032798.1; -. 

SQ SEQUENCE 1732 AA; 195892 MW; 64847A1C8BE5A6A1 CRC64; 

Query Match 78.0%; Score 32; DB 4; Length 1732; 

Best Local Similarity 75.0%; Pred. No. 9.7e+02; 

Matches 6; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

I III II 
Db 668 LTFFASDF 675 



RESULT 50 
097917 

ID 097917 PRELIMINARY; PRT; 49 AA. 



AC 097917; 

DT 01-MAY-1999 (TrEMBLrel. 10, Created) 

DT 01-MAY-1999 (TrEMBLrel. 10, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Amyloid protein (Fragment) . 

GN APP. 

OS Bos taurus (Bovine) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla; Ruminantia; Pecora; Bovoidea; 

OC Bovidae; Bovinae; Bos. 

OX NCBI__TaxID-9913; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE-20063685; PubMed=10594237 ; 

RA Konfortov B.A., Licence V.E., Miller J.R.; 

RT "Resequencing of DNA from a diverse panel of cattle reveals a high 

RT level of polymorphism in both intron and exon."; 

RL Mamm. Genome 10:1142-1145(1999). 

DR EMBL; AJ133033; CAB38017.1; -. 

DR HSSP; P05067; 1BA4 . 

DR GO; GO: 0016020; C:membrane; IEA. 

DR InterPro; IPR001255; Beta-APP. 

DR Pfam; PF03494; Beta-APP; 1. 

FT NON_TER 1 1 

FT NON_TER 49 4 9 

SQ SEQUENCE 49 AA; 5183 MW; 6287463F0559BDED CRC64; 



Query Match 75.6%; Score 31; DB 6; Length 49; 

Best Local Similarity 100.0%; Pred. No. 41; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 2 VFFAED 7 

I I I I I I 

Db 1 VFFAED 6 



RESULT 51 
Q974D3 

ID Q974D3 PRELIMINARY; PRT; 141 AA. 

AC Q974D3; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Hypothetical protein ST0719. 

GN ST0719. 

OS Sulfolobus tokodaii. 

OC Archaea; Crenarchaeota; Thermoprotei ; Sulf olobales ; Sulf olobaceae; 

OC Sulfolobus. 

OX NCBI_TaxID=111955; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN^JCM 10545 / 7; 

RX MEDLINE=21456156; PubMed=l 157247 9; 

RA Kawarabayasi Y. , Hino Y., Horikawa H. , Jin-no K., Takahashi M. , 

RA Sekine M. , Baba S.-I., Ankai A., Kosugi H., Hosoyama A., Fukui S., 

RA Nagai Y. , Nishijima K. , Otsuka R. , Nakazawa H. , Takamiya M. , Kato Y. 

RA Yoshizawa T., Tanaka T., Kudoh Y., Yamazaki J., Kushida N., Oguchi A 



RA Aoki K.-I., Masucia S., Yanagii M. , Nishimura M. , Yamagishi A. , 

RA Oshima T . , Kikuchi H.; 

RT "Complete genome sequence of an aerobic thermoacidophilic 

RT Crenarchaeon, Sulfolobus tokodaii strain7."; 

RL DNA Res. 8:123-140(2001). 

DR EMBL; AP000983; BAB65727.1; -. 

KW Hypothetical protein; Complete proteome. 

SQ SEQUENCE 141 AA; 15582 MW; BD76CE0268B9920B CRC64; 



Query Match 75.6%; 
Best Local Similarity 62.5%; 
Matches 5; Conservative 



Score 31; DB 17; Length 141; 
Pred. No. 1.2e+02; 
2; Mismatches 1; Indels 



0; Gap 



Qy 

Db 



1 LVFFAEDF 8 

I : I : I I I 
22 LLFYVEDF 29 



RESULT 52 
P91460 

ID P91460 PRELIMINARY; 

AC P91460; 

DT 01-MAY-1997 (TrEMBLrel. 03, 

DT 01-MAY-1997 (TrEMBLrel. 03, 

DT 01-JUN-2003 (TrEMBLrel. 24, 

DE T20D4.15 protein. 

GN T20D4.15. 

OS Caenorhabditis elegans . 

OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea 

OC Rhabditidae; Peloderinae; Caenorhabditis. 

OX NCBI_TaxID=6239; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Bristol N2; 

RX MEDLINE=94150718; PubMed=7 906398 ; 

RA Wilson R. , Ainscough R. , Anderson K., Baynes C, Berks M., 

RA Bonfield J., Burton J., Connell M., Copsey T., Cooper J., Coulson A 

RA Craxton M., Dear S., Du Z., Durbin R. , Favello A., Fulton L., 

RA Gardner A., Green P., Hawkins T., Hillier L., Jier M. , Johnston L., 

RA Jones M. , Kershaw J., Kirsten J., Laister N., Latreille P., 

RA Lightning J., Lloyd C, McMurray A., Mortimore B., O'Callaghan M. , 

RA Parsons J., Percy C, Rifken L., Roopra A., Saunders D., Shownkeen 

RA Smaldon N., Smith A., Sonnhammer E., Staden R. , Sulston J., 

RA Thierry-Mieg J., Thomas K., Vaudin M. , Vaughan K., Waterston R. , 

RA Watson A., Weinstock L., Wilkinson-Sproat J., Wohldman P.; 

RT "2.2 Mb of contiguous nucleotide sequence from chromosome III of C. 

RT elegans . " ; 

RL Nature 368:32-38(1994). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN-Bristol N2; 

RA Minx, P, Graves T. ; 

RT "The sequence of C. elegans cosmid T20D4."; 

RL Submitted (JAN-1997) to the EMBL/ GenBank/ DDB J databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Bristol N2; 



PRT; 150 AA. 
Created) 

Last sequence update) 
Last annotation update) 



RA Waterston R. ; 

RL Submitted (NOV-1996) to the EMBL/ GenBank/DDBJ databases 

DR EMBL; U80029; AAB37582.1; -. 

DR PIR; T29939; T29939. 

DR WormPep; T20D4.15; CE13806. 

DR InterPro; IPR002542; DUF19. 

DR Pfam; PF01579; DUF19; 1. 

SQ SEQUENCE 150 AA; 17829 MW; 711200917D094EA4 CRC64; 



Query Match 75.6%; 
Best Local Similarity 62.5%; 
Matches 5; Conservative 



Score 31; DB 5; Length 150; 
Pred. No. 1.3e+02; 
3; Mismatches 0; Indels 



0; Gap 



Qy 

Db 



1 LVFFAEDF 8 

I I : I I : : I 

60 LVYFADEF 67 



RESULT 53 
Q8E591 

ID Q8E591 PRELIMINARY; PRT; 161 AA. 

AC Q8E591; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Hypothetical protein. 

GN GBS1141. 

OS Streptococcus agalactiae (serotype III). 

OC Bacteria; Firmicutes; Lactobacillales ; Streptococcaceae ; 

OC Streptococcus. 

OX NCBI_TaxID=216495; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=NEM316 / Serotype III; 

RX MEDLINE=22242508; PubMed=12354221; 

RA Glaser P., Rusniok C, Buchrieser C. , Chevalier F. , Frangeul L., 

RA Msadek T., Zouine M. , Couve E. , Lalioui L., Poyart C, Trieu-Cuot P 

RA Kunst F. ; 

RT "Genome sequence of Streptococcus agalactiae, a pathogen causing 

RT invasive neonatal disease."; 

RL Mol. Microbiol. 45:1499-1513(2002). 

DR EMBL; AL766849; CAD46800.1; -. 

DR SagaList; gbsll41; 

KW Hypothetical protein; Complete proteome. 

SQ SEQUENCE 161 AA; 18382 MW; EA300276002B122B CRC64; 

Query Match 75.6%; Score 31; DB 16; Length 161; 

Best Local Similarity 75.0%; Pred. No. 1.4e+02; 

Matches 6; Conservative 1; Mismatches 1; Indels 0; Gap 

Qy 1 LVFFAEDF 8 

I 111:11 
Db 118 LGFFAQDF 125 



RESULT 54 
Q82JK4 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RC 
RX 
RA 
RA 
RA 
RT 
RT 
RT 
RL 
RN 
RP 
RC 
RX 
RA 
RA 
RT 
RT 
RL 
DR 
DR 
DR 
DR 
KW 
SQ 



179 AA. 



Q82JK4 PRELIMINARY; PRT; 

Q82JK4; 

01-JUN-2003 (TrEMBLrel. 24, Created) 
01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 
01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 
Hypothetical protein. 
SAV2751. 

Streptomyces avermitilis . 

Bacteria; Actinobacteria; Actinobacteridae; Actinomycetales ; 
Streptomycineae; Streptomycetaceae; Streptomyces. 
NCBI__TaxID=33903; 

[1] 

SEQUENCE FROM N.A. 

STRAIN=MA-4 680 / ATCC 31267 / NCIMB 12804 / NRRL 8165; 
MEDLINE=2 14774 03; PubMed=1157294 8 ; 

Omura S., Ikeda H., Ishikawa J., Hanamoto A., Takahashi C, 
Shinose M., Takahashi Y., Horikawa H., Nakazawa H., Osonoe T-, 
Kikuchi H., Shiba T., Sakaki Y. , Hattori M. ; 

"Genome sequence of an industrial microorganism Streptomyces 
avermitilis: deducing the ability of producing secondary 
metabolites . " ; 

Proc. Natl. Acad. Sci. U.S.A. 98:12215-12220(2001). 
[2] 

SEQUENCE FROM N.A. 

STRAIN=MA-4680 / ATCC 31267 / NCIMB 
MEDLINE=22 608306; PubMed=12692562 ; 
Ikeda H., Ishikawa J., Hanamoto A., 
Sakaki Y., Hattori M. , Omura S.; 

"Complete genome sequence and comparative analysis of the industrial 

microorganism Streptomyces avermitilis."; 

Nat. Biotechnol. 21:526-531(2003) . 

EMBL; AP005032; BAC7 0462.1; -. 

InterPro; IPR005269; Cons__hypoth730 . 

Pfam; PF03641; Lysine_decarbox; 1. 

TIGRFAMs; TIGR00730; TIGR00730; 1. 

Hypothetical protein; Complete proteome. 

SEQUENCE 179 AA; 19286 MW; 75B53B6069659145 CRC64; 



12804 / NRRL 8165; 
Shinose M. , Kikuchi H 



Shiba T. , 



Query Match 75.6%; 
Best Local Similarity 85.7%; 
Matches 6; Conservative 



Score 31; DB 16; Length 179; 
Pred. No. 1.6e+02; 
1; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 



Db 



1 LVFFAED 

MINI: 
158 LVFFAEE 



164 



RESULT 55 
Q9Z588 

ID Q9Z588 PRELIMINARY 
AC Q9Z588; 

DT 01-MAY-1999 (TrEMBLrel. 
DT 01-MAY-1999 (TrEMBLrel. 
DT 01-JUN-2003 (TrEMBLrel. 
DE SC8D9.03 protein. 
GN SCOS491 OR SC8D9.03. 
OS Streptomyces coelicolor 



PRT; 182 AA. 
10, Created) 

10, Last sequence update) 
24, Last annotation update) 



OC Bacteria; Actinobacteria; Actinobacteridae; Actinomycetales ; 

OC Streptomycineae; Streptomycetaceae; Streptomyces . 

OX NCBI_TaxID=1902; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=A3 (2) ; 

RA Murphy L., Harris D.; 

RL Submitted (FEB-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN— A3 ( 2 ) ; 

RA Bentley S.D., Parkhill J., Barrell B.G., Ra j andream M. A. ; 

RL Submitted (FEB-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=A3 (2) ; 

RX MEDLINE^ 97000351; PubMed=8 843436; 

RA Redenbach M. , Kieser H.M., Denapaite D., Eichner A,, Cullum J., 

RA Kinashi H . , Hopwood D.A.; 

RT "A set of ordered cosmids and a detailed genetic and physical map for 

RT the 8 Mb Streptomyces coelicolor A3 (2) chromosome."; 

RL Mol. Microbiol. 21:77-96(1996). 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=A3(2) / M145; 

RX ME DL INE=2 1996410; PubMed= 12000953; 

RA Bentley S.D., Chater K.F., Cerdeno-Tarraga A.-M., Challis G.L., 

RA Thomson N.R., James K.D., Harris D.E., Quail M.A., Kieser H. , 

RA Harper D., Bateman A., Brown S. f Chandra G. , Chen C.W., Collins M. , 

RA Cronin A. , Fraser A., Goble A. , Hidalgo J., Hornsby T., Howarth S., 

RA Huang C.-H., Kieser T., Larke L. , Murphy L. , Oliver K. T O'Neil S., 

RA Rabbinowitsch E., Rajandream M.A. r Rutherford K. , Rutter S., 

RA Seeger K. , Saunders D., Sharp S., Squares R. , Squares S . , Taylor K-, 

RA Warren T., Wietzorrek A., Woodward J., Barrell B.G., Parkhill J., 

RA Hopwood D.A. ; 

RT "Complete genome sequence of the model actinomycete Streptomyces 

RT coelicolor A3 (2)."; 

RL Nature 417:141-147(2 002). 

DR EMBL; AL939123; CAB37567.1; -. 

DR PIR; T35807; T35807. 

DR InterPro; IPR005269; Cons_hypoth730 . 

DR Pfam; PF03641; Lysine_decarbox; 1. 

DR TIGRFAMs; TIGR00730; TIGR00730; 1. 

KW Complete proteome. 

SQ SEQUENCE 182 AA; 19721 MW; 8494A6107A939E49 CRC64; 

Query Match 75.6%; Score 31; DB 16; Length 182; 

Best Local Similarity 85.7%; Pred. No. 1.6e+02; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 LVFFAED 7 

I I I I I I : 
Db 161 LVFFAEE 167 



RESULT 56 
Q9VQ40 



ID Q9VQ4 0 PRELIMINARY; PRT; 244 AA. 

AC Q9VQ4 0; 

DT 01-MAY-2000 (TrEMBLrel . 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel . 23, Last annotation update) 

DE CG17652 protein. 

GN CG17652. 

OS Drosophila melanogaster (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophilidae; Drosophila. 

OX NCBI_TaxID=7227 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Berkeley; 

RX MEDLINE=20196006; PubMed=10731 132 ; 

RA Adams M.D., Celniker S.E., Holt R.A. , Evans C.A., Gocayne J.D., 

RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., 

RA George R.A., Lewis S.E., Richards S., Ashburner M. , Henderson S.N., 

RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., 

RA Brandon R.C., Rogers Y.-H.C, Blazej R.G., Champe M., Pfeiffer B.D.., 

RA Wan K.H., Doyle C, Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., 

RA Abril J.F., Agbayani A., An H.-J., Andrews-Pf annkoch C, Baldwin D., 

RA Ballew R.M. , Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., 

RA Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., 

RA Borkova D., Botchan M.R., Bouck J.,' Brokstein P., Brottier P., 

RA Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., 

RA Cherry J.M., Cawley S., Dahlke C, Davenport L.B., Da vies P., 

RA de Pablos B., Delcher A., Deng Z. f Mays A.D., Dew I., Dietz S.M., 

RA Dodson K., Doup L.E., Downes M. f Dugan-Rocha S., Dunkov B.C., Dunn P., 

RA Durbin K.J., Evangelista C.C., Ferraz C, Ferriera S., Fleischmann W. , 

RA Fosler C, Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K., 

RA Glodek A., Gong F . , Gorrell J.H., Gu Z., Guan P., Harris M. , 

RA Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck 

RA Hostin D., Houston K.A., Howland T.J., Wei M.-H., Ibegwam C, 

RA Jalali M. , Kalush F., Karpen G.H., Ke Z., Kennison J. A. , Ketchum K.A. , 

RA Kiiratiel B.E., Kodira CD., Kraft C, Kravitz S., Kulp D . , Lai Z., 

RA Lasko P., Lei Y., Levitsky A. A. , Li J., Li Z., Liang Y., Lin X., 

RA Liu X., Mattei B., Mcintosh T.C., McLeod M.P., McPherson D . , 

RA Merkulov G., Milshina N.V., Mobarry C, Morris J., Moshrefi A., 

RA Mount S.M., Moy M., Murphy B., Murphy L., Muzny D.M., Nelson D.L., 

RA Nelson D.R., Nelson K.A., Nixon K. , Nusskern D.R., Pacleb J.M., 

RA Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., 

RA Reinert K w Remington K., Saunders R.D.C., Scheeler F., Shen H., 

RA Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T., 

RA Spier E., Spradling A.C., Stapleton M. , Strong R., Sun E. f 

RA Svirskas R. , Tector C, Turner R. , Venter E., Wang A.H., Wang X., 

RA Wang Z.-Y., Wassarman D.A., Weinstock G.M., Weissenbach J., 

RA Williams S .M. , . Woodage T., Worley K.C., Wu D., Yang S., Yao Q. A. , 

RA Ye J., Yeh R.-F., Zaveri J.S., Zhan M. , Zhang G., Zhao Q. , Zheng L . , 

RA Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S., Zhu X., Smith H.O., 

RA Gibbs R.A. , Myers E.W., Rubin G.M., Venter J.C.; 

RT "The genome sequence of Drosophila melanogaster."; 

RL Science 287:2185-2195(2000). 

DR EMBL; AE003585; AAF51342.1; 

DR FlyBase; FBgn0031361; CG17652. 

DR InterPro; IPR006984; DUF652. 



DR Pfam; PF04900; DUF652; 1. 

SQ SEQUENCE 244 AA; 27504 MW; 08D54ACD7BCCB82 6 CRC64; 



Query Match 75.6%; 
Best Local Similarity 75.0%; 
Matches 6; Conservative 



Score 31; DB 5; Length 244; 
Pred. No. 2.1e+02; 
1; Mismatches 1; Indels 



0; Gaps 



0; 



QY 



Db 



1 LVFFAEDF 8 

I I I I I : I 

13 LVFFASNF 2 0 



RESULT 57 
Q8NN32 

ID Q8NN32 PRELIMINARY; PRT; 259 AA. 

AC Q8NN32; 

DT 01-OCT-2002 (TrEMBLrel . 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Transcriptional regulator. 

GN CGL2381. 

OS Corynebacterium glutamicum (Brevibacterium flavum) . 

OC Bacteria; Actinobacteria; Actinobacteridae; Actinomycetales ; 

OC Corynebacterineae; Corynebacteriaceae ; Corynebacterium. 

OX NCBI_TaxID=1718; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN^ATCC 13032 / DSM 20300 / NCIB 10025; 

RA Nakagawa S . ; 

RT "Complete genomic sequence of Corynebacterium glutamicum ATCC 13032."; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AP005281; RAB99774.1; 

DR GO; GO: 0003700; F : transcription factor activity; IEA. 

DR GO; GO: 0006355; P: regulation of transcription, DNA-dependent ; IEA. 

DR InterPro; IPR001647; HTH_TetR. 

DR Pfam; PF00440; tetR; 1. 

KW Complete proteome . 

SQ SEQUENCE 259 AA; 28363 MW; D487361648C34A2C CRC64; 

Query Match 75.6%; Score 31; DB 16; Length 259; 

Best Local Similarity 85.7%; Pred. No. 2.3e+02; 

Matches 6; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LVFFAED 7 

I I I I I I 

Db 37 LTFFAED 43 



RESULT 58 
Q9BGL3 

ID Q9BGL3 PRELIMINARY; PRT; 262 AA. 

AC Q9BGL3; 

DT 01-JUN-2001 (TrEMBLrel. 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Membrane type-1 metalloproteinase (Fragment) . 

OS Ovis aries (Sheep) . 



OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla; Ruminantia; Pecora; Bovoidea; 

OC Bovidae; Caprinae; Ovis. 

OX NCBI_TaxID=994 0; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Ovary; 

RA Smith G.W., Ricke W.A., Cassar C.A., Smith M.F.; 

RL Submitted (MAY-2000) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF267160; AAG59848.1; 

DR HSSP; P08254; 1HFS . 

DR MEROPS; M10.014; -. 

DR GO; GO: 0005578; C : extracellular matrix; IEA. 

DR GO; GO: 0004222; Frmetalloendopeptidase activity; IEA. 

DR GO; GO: 0008270; F: zinc ion binding; IEA. 

DR GO; GO: 0006508; P : proteolysis and peptidolysis ; IEA. 

DR InterPro; IPR000585; Hemopexin. 

DR InterPro; IPR006026; Peptidase_M. 

DR InterPro; IPR001818; Pept_Ml0A_Ml2B . 

DR InterPro; IPR006025; Pept _M_Zn_BS . 

DR Pfam; PF00045; hemopexin; 2. 

DR Pfam; PF00413; Peptidase_M10 ; 1. 

DR PRINTS; PR00138; MATRIXIN. 

DR SMART; SM00120; HX; 2. 

DR SMART; SM00235; ZnMc; 1. 

DR PROSITE; PS00024; HEMOPEXIN; 1. 

DR PROSITE; PS00142; ZINC_PROTEASE; 1. 

KW Hydrolase; Metal-binding; Metalloprotease ; Protease; Zinc. 

FT NON_TER 1 1 

FT NON_TER 262 262 

SQ SEQUENCE 262 AA; 29759 MW; 0C53ED138FE49271 CRC64; 



Query Match 75.6%; Score 31; DB 6; Length 262; 

Best Local Similarity 62.5%; Pred. No. 2.3e+02; 

Matches 5; Conservative 2; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

: : I I I I I 

Db 17 MIFFAEGF 24 



RESULT 59 
Q8Y477 

ID Q8Y477 PRELIMINARY; PRT; 291 AA. 

AC Q8Y477; 

DT 01-MAR-2002 (TrEMBLrel. 20, Created) 

DT 01-MAR-2002 (TrEMBLrel. 20, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein lmo2578. 

GN LM0257 8. 

OS Listeria monocytogenes. 

OC Bacteria; Firmicutes; Bacillales; Listeriaceae; Listeria. 

OX NCBI_TaxID=1639; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=EGD-e / Serovar l/2a; 

RX MEDLINE=2 1537279; PubMed=11679669 ; 



RA Glaser P., Frangeul L., Buchrieser C, Rusniok C, Amend A. , 

RA Baquero F., Berche P., Bloecker H., Brandt P., Chakraborty T., 

RA Charbit A., Chetouani F., Couve E., de Daruvar A., Dehoux P., 

RA Domann E., Dominguez-Bernal G. , Duchaud E., Durant L., Dussurget 0., 

RA Entian K.-D., Fsihi H., Garcia-del Portillo F. , Garrido P., 

RA Gautier L., Goebel W., Gomez-Lopez N., Hain T . , Hauf J., Jackson D., 

RA Jones L.-M., Kaerst U., Kreft J., Kuhn M. , Kunst F., Kurapkat G., 

RA Madueno E-, Maitournam A., Mata Vicente J., Ng E., Nedjari H., 

RA Nordsiek G., Novella S., de Pablos B., Perez-Diaz J.-C, Purcell R. , 

RA Remmel B., Rose M., Schlueter T., Simoes N., Tierrez A., 

RA Vazquez-Boland J. -A., Voss H. f Wehland J., Cossart P.; 

RT "Comparative genomics of Listeria species."; 

RL Science 294:849-852(2001). 

DR EMBL; AL591983; CAD00656.1; -. 

DR PIR; AB1397; AB1397. 

DR ListiList; LMO02578; -. 

DR GO; GO: 0003824; F: catalytic activity; IEA. 

DR InterPro; IPR000437; Prok_lipoprot_S . 

DR InterPro; IPR000379; Ser_estrs. 

DR PROSITE; PS00013; PROKAR__LI POPROTEIN; 1. 

KW Hypothetical protein; Complete proteome. 

SQ SEQUENCE 291 AA; 32216 MW; 4 605051B603FCE14 . CRC64 ; 



Query Match 75.6%; Score 31; DB 16; Length 291; 

Best Local Similarity 62.5%; Pred. No. 2.6e+02; 

Matches 5; Conservative 2; Mismatches 1; Indels 0; Gaps 0; 



Qy 1 LVFFAEDF 8 

I : I I I I : 
Db 145 LTYFAEDY 152 



RESULT 60 
Q7VMQ9 

ID Q7VMQ9 PRELIMINARY; PRT; 308 AA. 

AC Q7VMQ9; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein. 

GN HD0913. 

OS Haemophilus ducreyi. 

OC Bacteria; Proteobacteria; Gammaproteobacteria ; Pasteurellales ; 

OC Pasteurellaceae; Haemophilus. 

OX NCBI__TaxID=730; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=35000HP / ATCC 700724; 

RA Munson R.S. Jr., Ray W.C., Mahairas G. , Sabo P., Mungur R., 

RA Johnson L., Nguyen D., Wang J., Forst C, Hood L. ; 

RT "The complete genome sequence of Haemophilus ducreyi."; 

RL Submitted (JUN-2003) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AE017153; AAP95797.1; -. 

KW Hypothetical protein; Complete proteome. 

SQ SEQUENCE 308 AA; 35872 MW; 9FOBFD09FEBBF027 CRC64; 



Query Match 



75.6%; Score 31; DB 16; Length 308; 



Best Local Similarity 71.4%; 
Matches 5; Conservative 



Pred. No. 2.7e+02; 
1; Mismatches 1; Indels 0; Gaps 0; 



Qy 2 VFFAEDF 8 

: I I I II 
Db 146 IFFARDF 152 



RESULT 61 
020015 

ID 020015 PRELIMINARY; ' PRT; 361 AA. 

AC 020015; 

DT 01-JAN-1998 (TrEMBLrel. 05, Created) 

DT 01-JAN-1998 (TrEMBLrel. 05, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Intron maturase (Maturase K) (Fragment) . 

GN MATK. 

OS Bifora americana. 

OG Chloroplast. 

OC Eukaryota; Viridiplantae ; Streptophyta ; Embryophyta; Tracheophyta; 

OC Spermatophyta; Magnoliophyta; eudicotyledons ; core eudicots; asterids; 

OC campanulids; Apiales; Apiaceae; Apioideae; apioid super clade; 

OC apioid superclade incertae sedis; Bifora. 

OX NCBI_TaxID=4 8 030; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Plunkett G.M., Soltis D.E., Soltis P.S.; 

RT "Evolutionary patterns in Apiaceae: inferences based on matK sequence 

RT data . " ; 

RL Syst. Bot. 21:477-495(1996). 

CC -!- FUNCTION: PROBABLY ASSISTS IN SPLICING CHLOROPLAST GROUP II 

CC I NT RONS (BY SIMILARITY) - 

CC -!- SIMILARITY: WITH CORRESPONDING ORF IN OTHER PLANT CHLOROPLASTS , 

CC AND REGIONS OF SIMILARITY TO MATURASE-LIKE POLYPEPTIDES ENCODED BY 

CC MITOCHONDRIAL INTRONS . 

DR EMBL; U58551; AAB66257.1; 

DR GO; GO:0009507; C : chloroplast ; IEA. 

DR GO; GO: 0006397; P : mRNA processing; IEA. 

DR InterPro; IPR002866; MatK_N. 

DR Pfam; PF01824; MatK_N; 1. 

KW mRNA processing; Chloroplast. 

FT NON_TER 361 361 

SQ SEQUENCE 361 AA; 42877 MW; 017CE3453166C1C4 CRC64 ; 

Query Match 75.6%; Score 31; DB 8; Length 361; 
Best Local Similarity 71.4%; Pred. No. 3.2e+02; 

Matches 5; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 2 VFFAEDF 8 

: I I I II 

Db 74 IFFANDF 80 



RESULT 62 
066650 

ID 066650 PRELIMINARY; PRT; 368 AA. 

AC 066650; 



DT Ol-AUG-1998 (TrEMBLrel. 07, Created) 

DT Ol-AUG-1998 (TrEMBLrel . 07, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Hypothetical protein AQ_303. 

GN AQ_303. 

OS * Aquifex aeolicus. 

OC Bacteria; Aquificae; Aquificales; Aquif icaceae; Aquifex. 

OX NCBI_TaxID=63363; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=VF5; 

RX MEDLINE=98 196666; PubMed=9537320 ; 

RA Deckert G., Warren P.V. , Gaasterland T., Young W.G., Lenox A.L., 

RA Graham D.E., Overbeek R. , Snead M.A. , Keller M. , Aujay M. , Huber R., 

RA Feldman R.A. , Short J.M. , Olson G.J., Swarison R.V. ; 

RT "The complete genome of the hyper thermophilic bacterium Aquifex 

RT aeolicus . " ; 

RL Nature 392:353-358(1998). 

DR EMBL; AE000683; AAC06619.1; 

DR PIR; F70327; F70327. 

DR HSSP; 027564; 1LNQ. 

DR GO; GO: 0016020; C:membrane; IEA. 

DR GO; GO: 0008324; F: cation transporter activity; IEA. 

DR GO; GO: 0005267; F:potassium channel activity; IEA. 

DR GO; GO: 0006813; P:potassium ion transport; IEA. 

DR InterPro; IPR001622; K+ channel_pore . 

DR InterPro; IPR006037.; TrkAC. 

DR InterPro; IPR003148; TrkA_N. 

DR Pfam; PF02080; TrkA-C; 1. 

DR Pfam; PF02254; TrkA-N; 1. 

KW Hypothetical protein; Complete proteome. 

SQ SEQUENCE 368 AA; 41864 MW; FC3E0600345EB236 CRC64 ; 



Query Match 75.6%; Score 31; DB 16; Length 368; 

Best Local Similarity 71.4%; Pred. No. 3.2e+02; 

Matches 5; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 2 VFFAEDF 8 

: I I III 

Db 107 IFFQEDF 113 



RESULT 63 
Q7WD43 

ID Q7WD43 PRELIMINARY; PRT; 370 AA. 

AC Q7WD43; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Putative integral membrane protein. 

GN BB3735. 

OS Bordetella bronchiseptica (Alcaligenes bronchisepticus ) . 

OC Bacteria; Proteobacteria; Betaproteobacteria; Burkholderiales ; 

OC Alcaligenaceae; Bordetella. 

OX NCBI_TaxID=518; 

RN [1] 

RP SEQUENCE FROM N.A. 



RC STRAIN=RB50 / ATCC BAA-58 8; 

RX MEDLINE=22827954; PubMed=12910271 ; 

RA Parkhill J. f Sebaihia M. , Preston A., Murphy L.D., Thomson N., 

RA Harris D.E., Holden M.T.G., Churcher CM., Bentley S.D., Mungall K.L. 

RA Cerdeno-Tarraga A.M., Temple L., James K. , Harris B., Quail M.A. , 

RA Achtman M. , Atkin R., Baker S., Basham D. , Bason N., Cherevach I., 

RA Chillingworth T., Collins M., Cronin A., Davis P., Doggett J., 

RA Feltwell T . , Goble A., Hamlin N . , Hauser H . , Holroyd S., Jagels K., 

RA Leather S., Moule S., Norberczak H., 0 ! Neil S., Ormond D., Price C, 

RA Rabbinowitsch E., Rutter S., Sanders M. , Saunders D . , Seeger K. , 

RA Sharp S., Simmonds M. , Skelton J., Squares R. , Squares S., Stevens K. 

RA Unwin L., Whitehead S., Barrell B.G., Maskell D.J.; 

RT "Comparative analysis of the genome sequences of Bordetella pertussis 

RT Bordetella parapertussis and Bordetella bronchiseptica . " ; 

RL Nat. Genet. 35:32-40(2003). 

DR EMBL; BX640448; CAE35709.1; 

KW Complete proteome. 

SQ SEQUENCE 370 AA; 40050 MW; C9E4D8 123E6F2 8 8B CRC64; 

Query Match 75.6%; Score 31; DB 16; Length 370; 

Best Local Similarity 100.0%; Pred. No. 3.3e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 2 VFFAED 7 

I I I I I I 

Db 159 VFFAED 164 



RESULT 64 
Q7W5K7 

ID Q7W5K7 PRELIMINARY; PRT; 370 AA. 

AC Q7W5K7; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Putative integral membrane protein. 

GN BPP3284. 

OS Bordetella parapertussis. 

OC Bacteria; Proteobacteria; Betaproteobacteria; Burkholderiales ; 

OC Alcaligenaceae; Bordetella. 

OX NCBI_TaxID=519; 

RN [1] 

RP SEQUENCE FROM N. A. 

RC STRAIN=12 822 / ATCC BAA-587; 

RX MEDLINE-22827954; PubMed-12910271 ; 

RA Parkhill J., Sebaihia M. , Preston A., Murphy L.D., Thomson N., 

RA Harris D.E., Holden M.T.G., Churcher CM., Bentley S.D., Mungall K.L. 

RA Cerdeno-Tarraga A.M., Temple L., James K., Harris B., Quail M.A. , 

RA Achtman M. , Atkin R. , Baker S., Basham D., Bason N. , Cherevach I., 

RA Chillingworth T., Collins M., Cronin A., Davis P., Doggett J., 

RA Feltwell T . , Goble A., Hamlin N., Hauser H., Holroyd S., Jagels K., 

RA Leather S., Moule S., Norberczak H., O'Neil S., Ormond D., Price C, 

RA Rabbinowitsch E. , Rutter S., Sanders M., Saunders D., Seeger K. , 

RA Sharp S., Simmonds M. , Skelton J., Squares R. , Squares S., Stevens K. 

RA Unwin L., Whitehead S., Barrell B.G., Maskell D.J.; 

RT "Comparative analysis of the genome sequences of Bordetella pertussis 

RT Bordetella parapertussis and Bordetella bronchiseptica."; 



RL Nat. Genet. 35:32-40(2003). 

DR EMBL; BX640433; CAE38569.1; -. 

KW Complete proteome. 

SQ SEQUENCE 370 AA; 40110 MW; C9E4D8 073E60823B 1 CRC64 ; 

Query Match 75.6%; Score 31; DB 16; Length 370; 

Best Local Similarity 100.0%; Pred. No. 3.3e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 2 VFFAED 7 

I I I I I I 

Db 159 VFFAED 164 



RESULT 65 
Q7VW49 

ID Q7VW4 9 PRELIMINARY; PRT; 370 AA. 

AC Q7VW4 9; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Putative integral membrane protein. 

GN BP2420. 

OS Bordetella pertussis. 

OC Bacteria; Proteobacteria; Betaproteobacteria; Burkholderiales ; 

OC Alcaligenaceae; Bordetella. 

OX NCBI_TaxID=520; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Tohama I / ATCC BAA-589 / NCTC 13251; 

RX MEDLINE=22827954; PubMed-12910271 ; 

RA Parkhill J., Sebaihia M. , Preston A. f Murphy L.D., Thomson N., 

RA Harris D.E., Holden M.T.G., Churcher CM., Bentley S.D., Mungall K.L. 

RA Cerdeno-Tarraga A.M., Temple L. , James K. , Harris B., Quail M.A., 

RA Achtman M. , Atkin R. , Baker S., Basham D . , Bason N., Cherevach I., 

RA Chillingworth T., Collins M. , Cronin A., Davis P., Doggett J., 

RA Feltwell T., Goble A., Hamlin N., Hauser H., Holroyd S., Jagels K., 

RA Leather S., Moule S., Norberczak H. f O'Neil S., Ormond D. , Price C. f 

RA Rabbinowitsch E. , Rutter S., Sanders M. , Saunders D . , Seeger K., 

RA Sharp S., Simmonds M. , Skelton J., Squares R. , Squares S., Stevens K. 

RA Unwin L., Whitehead S., Barrell B.G., Maskell D.J.; 

RT "Comparative analysis of the genome sequences of Bordetella pertussis 

RT Bordetella parapertussis and Bordetella bronchiseptica . " ; 

RL Nat. Genet. 35:32-40(2003). 

DR EMBL; BX640418; CAE42691.1; 

KW Complete proteome. 

SQ SEQUENCE 370 AA; 40007 MW; BF3FB5A284 84 8 63F CRC64; 

Query Match 75.6%; Score 31; DB 16; Length 370; 
Best Local Similarity 100.0%; Pred. No. 3.3e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 2 VFFAED 7 

I I I I I I 

Db 159 VFFAED 164 



RESULT 66 
Q8D9D2 

ID Q8D9D2 PRELIMINARY; PRT; 392 AA. 

AC Q8D9D2; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Glycosyltransf erase . 

GN W12671. 

OS Vibrio vulnificus. 

OC Bacteria; Proteobacteria; Gammaproteobacteria; Vibrionales; 

OC Vibrionaceae; Vibrio. 

OX NCBI_TaxID=672 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=CMCP6; 

RA Rhee J.H., Kim S.Y., Chung S.S., Kim J. J., Moon Y.H., Jeong H., 

RA Choy H.E. ; 

RT "Complete genome sequence of Vibrio vulnificus CMCP6 . " ; 

RL Submitted (DEC-2002) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; AE016806; AAO11018.1; 

DR GO; GO:0004867; Frserine protease inhibitor activity; IEA. 

DR GO; GO: 0016740; F : trans f erase activity; IEA. 

DR InterPro; IPR000215; Serpin. 

DR PROSITE; PS00284; SERPIN; 1. 

KW Transferase; Complete proteome. 

SQ SEQUENCE 392 AA; 44970 MW; 4152383F26B0A889 CRC64; 



Query Match 75.6%; 
Best Local Similarity 62.5%; 
Matches 5; Conservative 



Score 31; DB 16; Length 392; 
Pred. No. 3.5e+02; 
2; Mismatches 1; Indels 



0 ; Gaps 



0; 



QY 



Db 



1 LVFFAEDF 8 

: : I I I I I 
4 MIVFAEDF 11 



RESULT 67 
006056 

ID 006056 PRELIMINARY; PRT; 438 AA. 

AC 006056; 

DT 01-JUL-1997 (TrEMBLrel. 04, Created) 

DT 01-JUL-1997 (TrEMBLrel. 04, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein (Fragment) . 

OS Mycobacterium gordonae. 

OC Bacteria; Actinobacteria; Actinobacteridae; Actinomycetales ; 

OC Corynebacterineae; Mycobacteriaceae; Mycobacterium. 

OX NCBI_TaxID=1778; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=ATCC 1447 0; 

RA Picardeau M. , Vincent V. ; 

RT "Identification and characterization of IS-like elements in 

RT Mycobacterium gordonae."; 

RL Submitted (MAR-1997) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; U95315; AAB54013.1; 



KW Hypothetical protein. 
FT NON_TER 438 4 38 

SQ SEQUENCE 438 AA; 47987 MW; 07E3E0125C4A62BB CRC64 ; 

Query Match 75.6%; Score 31; DB 2; Length 438; 

Best Local Similarity 71.4%; Pred. No. 3.9e+02; 

Matches 5; Conservative 2; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 VFFAEDF 8 

: I I I I I : 
Db 411 MFFAEDY 417 



RESULT 68 
Q8U383 

ID Q8U383 PRELIMINARY; 

AC Q8U383; 

DT 01-JUN-2002 (TrEMBLrel. 21, 

DT 01-JUN-2002 (TrEMBLrel. 21, 

DT 01-JUN-2003 (TrEMBLrel. 24, 

DE Phospho-sugar mutase. 

GN PF0588. 

OS Pyrococcus furiosus. 

OC Archaea; Euryarchaeota; Thermococci; Thermococcales ; Thermococcaceae; 

OC Pyrococcus . 

OX NCBI_TaxID=2261; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN-Vcl / DSM 3638 / ATCC 43587 / JCM 8422; 

RA Weiss R.B., Dunn D.M., Robb F.T., Brown J.R.; 

RT "The complete sequence of the Pyrococcus furiosus genome."; 

RL Submitted (FEB-2002) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; AE010181; AAL80712.1; -. 

DR GO; , GO: 0016868; F: intramolecular transferase activity, phosph. . .; IEA. 

DR GO; GO: 0005975; P : carbohydrate metabolism; IEA. 

DR InterPro; IPR005841; PG/PMM_mutase. 

DR InterPro; IPR005844; PG_PMM_ABAI . 

DR InterPro; IPR005845; PG_PMM_ABAII . 

DR InterPro; IPR005846; PG_PMM_ABAIII . 

DR InterPro; I PRO 0584 3; PG_PMM_C. 

DR Pfam; PF00408; PGM_PMM; 1. 

DR Pfam; PF0287 8; PGM_PMM_I ; 1. 

DR Pfam; PF0287 9; PGM__PMM__II ; 1. 

DR Pfam; PF02880; PGM_PMM_III; 1. 

DR PRINTS; PR00509; PGMPMM. 

KW Complete proteome. 

SQ SEQUENCE 455 AA; 49632 MW; 03682D03727 6ABE4 CRC64; 

Query Match 75.6%; Score 31; DB 17; Length 455; 

Best Local Similarity 71.4%; Pred. No. 4e+02; 

Matches 5; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 2 VFFAEDF 8 

: I I III 

Db 131 IFFKEDF 137 



PRT; 455 AA. 
Created) 

Last sequence update) 
Last annotation update) 



RESULT 69 
Q8R7Q5 

ID Q8R7Q5 PRELIMINARY; PRT; 519 AA. 

AC Q8R7Q5; 

DT 01-JUN-2002 (TrEMBLrel. 21, Created) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE AraC-type DNA-binding domain-containing proteins. 

GN ARAC OR TTE2343. 

OS Thermoanaerobacter tengcongensis. 

OC Bacteria; Firmicutes; Clostridia; Thermoanaerobacteriales ; 

OC Thermoanaerobacteriaceae; Thermoanaerobacter. 

OX NCBI_TaxID=119072; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=MB4 / JCM 11007; 

RX MEDLINE=2 1992816; PubMed= 11997336; 

RA Bao Q. , Tian Y., Li W., Xu Z., Xuan Z., Hu S., Dong W. , Yang J., 

RA Chen Y., Xue Y., Xu Y. , Lai X., Huang L., Dong X., Ma Y., Ling L., 

RA Tan H. , Chen R. , Wang J., Yu J., Yang H.; 

RT "A complete sequence of T. tengcongensis genome."; 

RL Genome Res. 12:68 9-700(2002). 

DR EMBL; AE013176; AAM25484.1; 

DR GO; GO: 0005622; C : intracellular ; IEA. 

DR GO; GO: 0003700; F: transcription factor activity; IEA. 

DR GO; GO: 0000156; F: two-component response regulator activity; IEA. 

DR GO; GO:0000160; P : two- component signal transduction system (p. . .; IEA. 

DR InterPro; IPR000005; HTHAraC . 

DR InterPro; IPR001789; Response__reg . 

DR Pfam; PF00165; HTH_AraC; 2. 

DR Pfam; PF00072; response_reg; 1. 

DR PRINTS; PR00032; HTHARAC. 

DR ProDom; PD000039; Response_reg; 1. 

DR SMART; SM00342; HTH_ARAC; 1. 

DR SMART; SM00448; REC; 1. 

DR PROSITE; PS01124; HTH_ARAC_FAMILY_2 ; 1. 

DR PROSITE; PS50110; RESPONSE_REGULATORY; 1. 

KW DNA-binding; Complete proteome. 

SQ SEQUENCE 519 AA; 60317 MW; 9E3477C8EC53F604 CRC64; 



Query Match 75.6%; 
Best Local Similarity 75.0%; 
Matches 6; Conservative 



Score 31; DB 16; Length 519; 
Pred. No. 4.6e+02; 
1; Mismatches 1; Indels 



0; Gaps 



0; 



Qy 

Db 



1 LVFFAEDF 8 

I IIIIM 
286 LKFFAEEF 2 93 



RESULT 7 0 
Q7X127 

ID Q7X127 PRELIMINARY; PRT; 524 AA. 

AC Q7X127; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 
DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 
DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 
DE TrwB. 



GN TRWB. 

OS Xanthomonas campestris (pv. citri) . 

OG Plasmid pXcB. 

OC Bacteria; Proteobacteria; Gammaproteobacteria; Xanthomonadales ; 

OC Xanthomonadaceae; Xanthomonas. 

OX NCBI_TaxID=34 6; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Yuan Q. , Brunings A.M., El-Yacoubi B., Shanker S., Gabriel D.W.; 

RT "A self-mobilizing plasmid from a South American citrus canker strain 

RT carries required pathogenicity gene pthB." ; 

RL Submitted (JAN-2003) to the EMBL/GenBank/DDB J databases. 

DR EMBL; AY228335; AAO72101.1; 

KW Plasmid. 

SQ SEQUENCE 524 AA; 58169 MW; AF362298C39955C2 CRC64; 



Query Match 75.6%; Score 31; DB 2; Length 524; 

Best Local Similarity 85.7%; Pred. No. 4.7e+02; 

Matches 6; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 2 VFFAEDF 8 

I I I I I I 

Db 4 96 VMFAEDF 502 



RESULT 
Q8PRK2 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OG 



71 



OC 
OC 
OX 
RN 
RP 
RC 
RX 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 



524 AA. 



Created) 

Last sequence update) 
Last annotation update) 



citri ) 



Q8PRK2 PRELIMINARY; PRT; 

Q8PRK2; 

01-OCT-2002 (TrEMBLrel. 22, 
01-OCT-2002 (TrEMBLrel. 22, 
01-JUN-2003 (TrEMBLrel. 24, 
TrwB protein. 
TRWB OR XACB0030. 
Xanthomonas axonopodis (pv. 
Plasmid pXAC64 . 

Bacteria; Proteobacteria; Gammaproteobacteria; Xanthomonadales; 
Xanthomonadaceae; Xanthomonas. 
NCBI_JTaxID=92829; 
[1] 

SEQUENCE FROM N.A. 

STRAIN=306 / ATCC 13902 / XV 101; 
MEDLINE-22022145; PubMed=12024217 ; 

da Silva A.C.R., Ferro J. A., Reinach F.C., Farah C.S., 
Quaggio R.B., Monteiro-Vitorello C.B., Van Sluys M.A., 



Furlan L.R. 



Alves L.M.C., 
Camarotte G. , 



do Amaral A.M., Bertolini M.C., Camargo 



Almeida N.F. 
L. E. A. , 



Cannavan F. 



Cicarelli R.M.B., Coutinho 



Cardozo J., Chambergo F., Ciapina L 
L.L., Cursino-Santos J.R., El-Dorry 



P., 



Faria J.B., Ferreira A.J.S., Ferreira R.C.C., Ferro M.I.T., 
Formighieri E.F., Franco M.C., Greggio C.C., Gruber A., 
Katsuyama A.M., Kishi L.T., Leite R.P., Lemos E.G.M., Lemos M.V.F., 
Locali E.C., Machado M.A., Madeira A.M.B.N., Martinez-Rossi N.M., 
Martins E.C., Meidanis J., Menck C.F.M., Miyaki C.Y., Moon D.H., 
Moreira 
Pereira 



L.M., Novo M.T.M., Okura V.K., Oliveira M.C., Oliveira V.R., 
H.A., Rossi A., Sena J.A.D., Silva C, de Souza R.F., 



Spinola L.A.F., Takita M.A., Tamura R.E., Teixeira E.C., Tezza R.I.D. 
Trindade dos Santos M. , Truffi D., Tsai S.M., White F.F., 



RA Setubal J.C., Kitajima J.P.; 

RT "Comparison of the genomes of two Xanthomonas pathogens with differing 

RT host specificities."; 

RL Nature 417:459-4 63(2002). 

DR EMBL; AE008925; AAM39276.1; 

DR GO; GO: 0046821; C : extra chromosomal DNA; IEA. 

DR GO; GO: 0016020; C:membrane; IEA. 

DR GO; GO: 0009291; P : unidirectional conjugation; IEA. 

DR InterPro; IPR003688; TRAG. 

DR Pfam; PF02534; TRAG; 1. 

KW Plasmid; Complete proteome. 

SQ SEQUENCE 524 AA; 58075 MW; 17D3 067 94D35A3D5 CRC64; 

Query Match 75.6%; Score 31; DB 16; Length 524; 

Best Local Similarity 85.7%; Pred. No. 4.7e+02; 

Matches 6; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 2 VFFAEDF 8 

I I I I I I 

Db 496 VMFAEDF 502 



RESULT 72 
Q96AA0 

ID Q96AA0 PRELIMINARY; PRT; 537 AA. 

AC Q96AA0; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein FLJ14869. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Placenta; 

RA Isogai T., Ota T., Hayashi K. , Sugiyama T. f Otsuki T., Suzuki Y., 

RA Nishikawa T., Nagai K., Sugano S., Takahashi-Fuj ii A., Hara H., 

RA Tanase T . , Nomura Y., Togiya S., Komai F. , Hara R. , Takeuchi K., 

RA Arita M. , Nabekura T., Ishii S., Kawai Y. , Saito K., Yamamoto J., 

RA Wakamatsu A., Nakamura Y., Nagahari K. , Masuho Y. f Oshima A.; 

RT "NEDO human cDNA sequencing project."; 

RL Submitted (MAY-2001) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AK027775; BAB55358.1; -. 

DR GO; GO: 0005509; F: calcium ion binding; IEA. 

DR InterPro; IPR000152; Asx_hydroxyl_S . 

DR InterPro; IPR001881; EGF_Ca. 

DR InterPro; IPR006209; EGF_like. 

DR InterPro; IPR006210; IEGF. 

DR InterPro; IPR002035; VWF_A. 

DR Pfam; PF00008; EGF; 6. 

DR Pfam; PF00092; vwa; 1. 

DR PRINTS; PR00453; VWFADOMAIN. 

DR SMART; SM00181; EGF; 5. 

DR SMART; SM00327; VWA; 1. 

DR PROSITE; PS00010; ASX HYDROXYL; 4. 



DR PROSITE; PS01186; EGF_2 ; 5. 

DR PROSITE; PS50234; VWFA; 1. 

KW Hypothetical protein; EGF-like domain. 

SQ SEQUENCE 537 AA; 60234 MW; 5577F5C55B38 944 8 CRC64; 

Query Match 75.6%; Score 31; DB 4; Length 537; 

Best Local Similarity 71.4%; Pred. No. 4.8e+02; 

Matches 5; Conservative 2; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 VFFAEDF 8 

: I : I I I I 

Db 395 LFYAEDF 4 01 



RESULT 73 
Q9XSP0 

ID Q9XSP0 PRELIMINARY; PRT; 582 AA. 

AC Q9XSP0; 

DT 01-NOV-1999 (TrEMBLrel. 12, Created) 

DT 01-NOV-1999 (TrEMBLrel. 12, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Membrane-type matrix metalloproteinase-1 . 

GN MT1MMP. 

OS Capra hircus (Goat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla; Ruminantia; Pecora; Bovoidea; 

OC Bovidae; Caprinae; Capra. 

OX NCBI_TaxID=9925; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Placenta; 

RA Uekiata T., Tanaka S., Mariko Y. , Sato H . , Seiki M. , Tojo H., 

RA Tachi C. ; 

RT "cDNA sequence of membrane-type matrix metalloproteinase of Shiba goat 

RT (Capra hircus var Shiba)."; 

RL Submitted (FEB-1998) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AB010921; BAA36551.1; 

DR HSSP; P39900; 1JIZ. 

DR MEROPS; M10.014; -. 

DR GO; GO:0005578; C : extracellular matrix; IEA. 

DR GO; GO: 0016020; C:membrane; IEA. 

DR GO; .GO: 0004222; F : metalloendopeptidase activity; IEA. 

DR GO; GO: 0008270; F:zinc ion binding; IEA. 

DR GO; GO:0006508; P : proteolysis and peptidolysis ; IEA. 

DR InterPro; IPR000585; Hemopexin. 

DR InterPro; IPR006026; Peptidase_M. 

DR InterPro; IPR001818; Pept__M10A_M12B . 

DR InterPro; IPR006025; Pept_M_Zn_BS . 

DR Pfam; PF00045; hemopexin; 4. 

DR Pfam; PF00413; Peptidase_M10 ; 1. 

DR Pfam; PF03933; Peptidase_M10_N; 1. 

DR PRINTS; PR00138; MATRIXIN. 

DR SMART; SM00120; HX; 4. 

DR SMART; SM00235; ZnMc; 1. 

DR PROSITE; PS00546; CYSTEINEJ5WITCH; 1. 

DR PROSITE; PS00024; HEMOPEXIN; 1. 

DR PROSITE; PS00142; ZINC PROTEASE; 1. 



KW Hydrolase; Membrane; Metal-binding; Metalloprotease; Protease; Zinc. 
SQ SEQUENCE 582 AA; 65878 MW; B2B4E1CEFDB7A4BE CRC64 ; 



Query Match 75.6%; Score 31; DB 6; Length 582; 

Best Local Similarity 62.5%; Pred. No. 5.2e+02; 

Matches 5; Conservative 2; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

: : I I I I I 

Db 177 MIFFAEGF 184 



RESULT 74 
Q9GLE4 

ID Q9GLE4 PRELIMINARY; PRT; 582 AA. 

AC Q9GLE4; 

DT. 01-MAR-2001 (TrEMBLrel. 16, Created) 

DT 01-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Membrane-type matrix metalloprotease 1. 

OS Bos taurus (Bovine) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Cetartiodactyla ; Ruminantia; Pecora; Bovoidea; 

OC Bovidae; Bovinae; Bos. 

OX NCBI_TaxID=9913; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Zhang B., Yan L., Moses M.A. , Fang J., Miao H., Tsang P.; 

RT "Molecular cloning and biological characterization of bovine membrane- 

RT type matrix metalloprotease 1 (bMTl-MMP) . " ; 

RL Submitted (JUL-2000) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; AF290429; AAG28170.1; -. 

DR HSSP; P39900; 1JIZ. 

DR MEROPS; M10.014; 

DR GO; GO:0005578; C: extracellular matrix; IEA. 

DR GO; GO: 0004222; Frmetalloendopeptidase activity; IEA. 

DR GO; GO: 0008270; F:zinc ion binding; IEA. 

DR GO; GO: 0006508; P : proteolysis and peptidolysis ; IEA. 

DR InterPro; IPR000585; Hemopexin. 

DR InterPro; IPR006026; PeptidaseJVL 

DR InterPro; IPR001818; Pept_M10A_M12B . 

DR InterPro; IPR006025; Pept_M_Zn_BS . 

DR Pfam; PF00045; hemopexin; 4. 

DR Pfam; PF00413; Peptidase_M10 ; 1. 

DR Pfam; PF03933; Peptidase_M10_N; 1. 

DR PRINTS; PR00138; MATRIXIN. 

DR SMART; SM00120; HX; 4. 

DR SMART; SM00235; ZnMc; 1. 

DR PROSITE; PS00546; CYSTEINE_SWITCH; 1. 

DR PROSITE; PS00024; HEMOPEXIN; 1. 

DR PROSITE; PS00142; ZINC_PROTEASE; 1. 

KW Hydrolase; Metal-binding; Metalloprotease; Protease; Zinc. 

SQ SEQUENCE 582 AA; 65883 MW; 65174CE65D4040E1 CRC64; 



Query Match 75.6%; Score 31; DB 6; Length 582; 

Best Local Similarity 62.5%; Pred. No. 5.2e+02; 

Matches 5; Conservative 2; Mismatches 1; Indels 0; Gaps 



0; 



Qy 1 LVFFAEDF 8 

: : I I I I I 
Db 177 MIFFAEGF 184 



RESULT 75 
Q86C23 

ID Q86C23 PRELIMINARY; PRT; 603 AA. 

AC Q8 6C23; 

DT 01-JUN-2003 (TrEMBLrel. 24, Created) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Mrell. 

OS Entamoeba histolytica. 

OC Eukaryota; Entamoebidae ; Entamoeba. 

OX NCBI_TaxID=5759 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Frisardi M. , Nixon J., Loftus B. , Samuelson J.; 

RT "How asexual are Giardia lamblia and Entamoeba histolytica?"; 

RL Submitted (FEB-2 002) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; AF485822; AAP35101.1; 

DR GO; GO: 0016787; F:hydrolase activity; IEA. 

DR InterPro; IPR004843; M-ppestrase. 

DR Pfam; PF00149; Metallophos; 1. 

SQ SEQUENCE 603 AA; 69760 MW; CD582A1ADC96A0E1 CRC64; 



Query Match 75.6%; 
Best Local Similarity 85 . 7% ; 
Matches 6; Conservative 

Qy 2 VFFAEDF 8 

III I I I 

Db 402 VFFDEDF 408 



Score 31; DB 5; Length 603; 
Pred. No. 5.4e+02; 
0; Mismatches 1; Indels 0; Gaps 0; 



RESULT 7 6 




Q98SG0 




ID 


Q98SG0 PRELIMINARY; PRT; 693 AA. 




AC 


Q98SG0; 




DT 


01-JUN-2001 (TrEMBLrel. 17, Created) 




DT 


01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 




DT 


01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 




DE 


Beta-amyloid precursor protein A. 




GN 


APP. 




OS 


Xenopus laevis (African clawed frog) . 




OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 


Euteleostomi ; 


OC 


Amphibia; Batrachia; Anura; Mesobatrachia; Pipoidea 


; Pipidae; 


OC 


Xenopodinae; Xenopus. 




OX 


NCBI TaxID=8355; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RA 


Van den Hurk W.H.; 




RL 


Thesis (2001), Department of Biological Sciences, 




RL 


University of Nijmegen, Nijmegen, Netherlands. 




DR 


EMBL; AJ298150; CAC37193.1; 





DR HSSP; P05067; 1HZ3. 

DR GO; GO: 0016020; C:membrane; IEA. 

DR InterPro; IPR008155; A4_APP. 

DR InterPro; IPR008154; A4_extra. 

DR InterPro; IPR001255; Beta-APP. 

DR Pfam; PF02177; A4_EXTRA; 1. 

DR Pfam; PF03494; Beta-APP; 1. 

DR PRINTS; PR00203; AMYLOIDA4. 

DR SMART; SM00006; A4_EXTRA; 1. 

DR PROSITE; PS00319; A4_EXTRA; 1. 

DR PROSITE; PS00320; A4_INTRA; 1. 

KW Signal. 

FT SIGNAL 1 18 POTENTIAL. 

SQ SEQUENCE 693 AA; 78568 MW; CAF1DF655C1AB653 CRC64; 



Query Match 75.6%; Score 31; DB 13; Length 693; 

Best Local Similarity 85,7%; Pred. No. 6.2e+02; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 LVFFAED 7 

I I I I I I : 
Db 611 LVFFAEE 617 



RESULT 77 
Q7Z5X1 

ID Q7Z5X1 PRELIMINARY; PRT; 715 AA. 

AC Q7Z5X1; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein (Fragment) . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Ovary; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B . , Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K., Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A. , McEwan P. J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A. , 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , Butterfield Y.S., 

RA Krzywinski M.I., Skalska U. , Smailus D.E., Schnerch A., Schein J.E., 

RA Jones S.J., Marra M.A. ; 



RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC T I S SUE=Ova r y ; 

RA Strausberg R. ; 

RL Submitted (OCT-2001) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; BC016394; AAH16394.1; 

KW Hypothetical protein. 

FT NON_TER 1 1 

SQ SEQUENCE 715 AA; 80057 MW; CBC5CEBE2D4E1F65 CRC64 ; 



Query Match 75.6%; 
Best Local Similarity 71.4%; 
Matches 5; Conservative 



Score 31; DB 4; Length 715; 
Pred. No. 6.4e+02; 
2; Mismatches 0; Indels 



0; Gaps 



Qy 

Db 



2 VFFAEDF 8 

: I : I I I I 

573 LFYAEDF 579 



RESULT 7 8 
Q91963 

ID Q91963 PRELIMINARY; PRT; 747 AA. 

AC Q91963; 

DT 01-NOV-1996 (TrEMBLrel. 01, Created) 

DT 01-NOV-1996 (TrEMBLrel. 01, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE APP747. 

GN APP747. 

OS Xenopus . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Amphibia; Batrachia; Anura; Mesobatrachia; Pipoidea; Pipidae; 

OC Xenopodinae. 

OX NCBI_TaxID=8353; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE-93129227; PubMed=1282805; 

RA Okado H., Okamoto H. ; 

RT "A Xenopus homologue of the human beta-amyloid precursor protein: 

RT developmental regulation of its gene expression."; 

RL Biochem. Biophys . Res. Commun. 189:1561-1568(1992). 

DR EMBL; S52417; AAB24853.1; 

DR HSSP; P05067; 1HZ3. 

DR GO; GO: 0016020; C:membrane; IEA. 

DR GO; GO: 0004867; F: serine protease inhibitor activity; IEA. 

DR InterPro; IPR008155; A4_APP. 

DR InterPro; IPR008154; A4_extra. 

DR InterPro; IPR001255; Beta-APP. 

DR InterPro; IPR002223; Kunitz_BPTI . 

DR Pfam; PF02177; A4__EXTRA; 1. 

DR Pfam; PF03494; Beta-APP; 1. 

DR Pfam; PF00014; Kunitz_BPTI ; 1. 

DR PRINTS; PR002 03; AMYLOIDA4 . 

DR PRINTS; PR00759; BASICPTASE. 

DR ProDom; PD000222; Kunitz BPTI; 1. 



DR SMART; SM00131; KU; 1. 

DR PROSITE; PS00319; A4_EXTRA; 1. 

DR PROSITE; PS00320; A4_INTRA; 1. 

DR PROSITE; PS00280; BPTI_KUNITZ_1 ; 1. 

DR PROSITE; PS50279; BPTI_KUNITZ_2 ; 1. 

KW Protease inhibitor; Serine protease inhibitor. 

SQ SEQUENCE 747 AA; 84893 MW; A75E81885681D948 CRC64; 

Query Match 75.6%; Score 31; DB 13; Length 747; 

Best Local Similarity 85.7%; Pred. No. 6.7e+02; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 LVFFAED 7 

I I I I I I : 

Db 665 LVFFAEE 671 



RESULT 7 9 
077066 

ID 077066 PRELIMINARY; PRT; 74 8 AA. 

AC 077066; 

DT 01-NOV-1998 (TrEMBLrel. 08, Created) 

DT 01-NOV-1998 (TrEMBLrel. 08, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Diapause associated protein 2. 

GN DAP 2 . 

OS Choristoneura fumiferana (Spruce budworm) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; 

OC Tortricoidea; Tortricidae; Tortricinae; Choristoneura. 

OX NCBI_TaxID=7141; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Palli S.R., Ladd T.R., Ricci A.R., Primavera M. , Mungrue I.N., 

RA Pang A.S.D., Retnakaran A. ; 

RT "Synthesis of the same two proteins prior to larval diapause and 

RT pupation in the spruce budworm, Choristoneura fumiferana."; 

RL J. Insect Physiol. 44:509-524(1998). 

DR EMBL; AF007768; AAC35429.1; -. 

DR HSSP; P04253; 1LLA. 

DR GO; GO: 0005344; F: oxygen transporter activity; IEA. 

DR GO; GO: 0006810; P: transport; IEA. 

DR InterPro; IPR008922; Di-copper_centre . 

DR InterPro; IPR000896; Hemocyanin. 

DR InterPro; IPR0052 03; hemocyanin__C . 

DR InterPro; IPR005204; hemocyanin_N. 

DR InterPro; IPR007110; Ig-like. 

DR Pfam; PF00372; hemocyanin; 1. 

DR Pfam; PF03723; hemocyanin_C; 1. 

DR Pfam; PF03722; hemocyanin__N; 1. 

DR PRINTS; PR00187; HAEMOCYANIN. 

DR PROSITE; PS00209; HEMOCYANIN_l ; 1. 

DR PROSITE; PS00210; HEMOCYANIN_2 ; 1. 

SQ SEQUENCE 748 AA; 88192 MW; 7354407BCF285CF5 CRC64 ; 



Query Match 

Best Local Similarity 



75.6%; Score 31; DB 5; Length 748; 
62.5%; Pred. No. 6.7e+02; 



Matches 5; Conservative 



3; Mismatches 0; Indels 0; Gap 



Qy 1 LVFFAEDF 8 

I :: I I : I I 

Db 109 LLYFAKDF 116 



RESULT 8 0 
Q96FT5 

ID Q9 6FT5 PRELIMINARY; PRT; 937 AA. 

AC Q9 6FT5; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Matrilin 2. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE^Brain; 

RA Strausberg R. ; 

RL Submitted (JUL-2001) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; BC010444; AAH10444.1; -. 

DR GO; GO: 0005509; F: calcium ion binding; IEA. 

DR InterPro; IPR000152; Asx_hydroxyl_S . 

DR InterPro; IPR001881; EGF_Ca. 

DR InterPro; IPR006209; EGF_like. 

DR InterPro; IPR002035; VWF_A. 

DR Pfam; PF00008; EGF; 10. 

DR Pfam; PF00092; vwa; 2. 

DR PRINTS; PR00453; VWFADOMAIN. 

DR SMART; SM0017 9; EGF_CA; 1. 

DR SMART; SM00327; VWA; 2. 

DR PROSITE; PS00010; ASX_HYDROXYL; 9. 

DR PROSITE; PS01186; EGF_2 ; 9. 

DR PROSITE; PS50234; VWFA; 2. 

KW EGF-like domain. 

SQ SEQUENCE 937 AA; 104775 MW; 7C7 6C5CE7 0F0 6E4 0 CRC64; 

Query Match 75.6%; Score 31; DB 4; Length 937; 

Best Local Similarity 71.4%; Pred. No. 8.4e+02; 

Matches 5; Conservative 2; Mismatches 0; Indels 0; Gap 

Qy 2 VFFAEDF 8 

: I : I I I I 
Db 814 LFYAEDF 820 



RESULT 81 
Q99K64 

ID Q99K64 PRELIMINARY 
AC Q99K64; 

DT 01-JUN-2001 (TrEMBLrel. 
DT 01-JUN-2001 (TrEMBLrel. 
DT 01-OCT-2003 (TrEMBLrel. 



PRT; 956 AA. 
17, Created) 

17, Last sequence update) 
25, Last annotation update) 



DE Matrilin 2. 

GN MATN2 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R. ; 

RL Submitted (MAR-2001) to the EMBL/GenBank/DDB J databases. 

DR EMBL; BC005429; AAH05429;1; 

DR HSSP; P05099; 1AQ5 . 

DR MGD; MGI: 109613; Matn2 . 

DR GO; GO: 0005578; C : extracellular matrix; IDA. 

DR InterPro; IPR000152; Asx__hydroxyl_S . 

DR InterPro; IPR001881; EGF_Ca. 

DR InterPro; IPR006209; EGF_like. 

DR InterPro; IPR006210; IEGF. 

DR InterPro; IPR002035; VWF_A. 

DR Pfam; PF00008; EGF; 10. 

DR Pfam; PF00092; vwa; 2. 

DR PRINTS; PR00453; VWFADOMAIN. 

DR SMART; SM00181; EGF; 10. 

DR SMART; SM00327; VWA; 2. 

DR PROSITE; PS00010; ASX_HYDROXYL ; 

DR PROSITE; PS01186; EGF_2 ; 9. 

DR PROSITE; PS50234; VWFA; 2. 

KW EGF-like domain. 

SQ SEQUENCE 956 AA; 106706 MW; 



7. 



373BFC774370BFEE CRC64; 



Query Match 75.6%; 
Best Local Similarity 71.4%; 
Matches 5; Conservative 



Score 31; DB 11; Length 956; 
Pred. No. 8.6e+02; 
2; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 

Db 



814 



VFFAEDF 8 

: I : I I I I 
LFYAEDF 820 



RESULT 82 
Q8R542 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 



OC 
OC 
OX 
RN 
RP 
RC 
RX 
RA 



956 AA. 



(TrEMBLrel. 21, 
(TrEMBLrel. 21, 
(TrEMBLrel. 25, 



Created) 

Last sequence update) 
Last annotation update) 



Q8R542 PRELIMINARY; PRT; 

Q8R542; 
01-JUN-2002 
01-JUN-2002 
01-OCT-2003 
Matrilin-2 . 
MATN2 . 

Mus musculus (Mouse) . 
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 
NCBI_TaxID=10090; 
[1] 

SEQUENCE FROM N.A. 
STRAIN=129/Sv; 

MEDLINE=2 1841526; PubMed-1 1852232; 

Mates L., Korpos E., Deak F., Liu Z., Beier D.R., Aszodi A., Kiss I. 



RT 
RT 
RT 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
SQ 



"Comparative analysis of the mouse 
for matrilin~2, a filament-forming 
extracellular matrices."; 
Matrix Biol. 21:163-174(2002). 



and human genes (Matn2 and MATN2) 
protein widely distributed in 



EMBL; AF358844 
EMBL; AF358831 
EMBL; AF358832 
EMBL; AF358833 
EMBL; AF358834 
EMBL; AF358835 
EMBL; AF358836 
EMBL; AF358837 
EMBL; AF358838 
EMBL; AF358839 
EMBL; AF358840 
EMBL; AF358841 
EMBL; AF358842 
EMBL; AF358843. 
GO; GO: 0005509; 
InterPro; IPR000152; 
InterPro; IPR001881; 
InterPro; IPR0062 09; 
InterPro; IPR006210; 
InterPro; IPR002035; 
Pfam; PF00008; EGF; 



AAM11539. 1 
AAM11539. 1 
AAM11539. 1 
AAM11539. 1 
AAM11539. 1 
AAM11539. 1 
AAM11539. 1 
AAM11539. 1 
AAM11539. 1 
AAM11539. 1 
AAM11539. 1 
AAM11539. 1 
AAM11539. 1 
AAM11539. 1 



JOINED 
JOINED 
JOINED 
JOINED 
JOINED 
JOINED 
JOINED 
JOINED 
JOINED 
JOINED 
JOINED 
JOINED 



JOINED. 

F: calcium ion binding; IEA. 
Asx_hydroxyl__S . 
EGF_Ca . 
EGF_like. 
IEGF. 
VWF_A. 
10. 



Pfam; PF00092; vwa; 2. 
PRINTS; PR00453; VWFADOMAIN. 
SMART; SM00181; EGF; 10. 
SMART; SM00327; VWA; 2. 
PROSITE; PS00010; ASX_HYDROXYL; 7. 
PROSITE; PS01186; EGF_2 ; 9. 
PROSITE; PS50234; VWFA; 2. 
EGF-like domain . 

SEQUENCE 956 AA; 106748 MW; 3E4A608 FF92BDE55 CRC64; 



Query Match 75.6%; 
Best Local Similarity 71.4%; 
Matches 5 ; Conservative 



Score 31; DB 11; Length 956; 
Pred. No. 8.6e+02; 
2; Mismatches 0; Indels 



0; Gaps 



Qy 

Db 



2 VFFAEDF 8 

: I : I I I I 

814 LFYAEDF 82 0 



RESULT 83 




Q8NDE6 




ID 


Q8NDE6 PRELIMINARY; PRT; 1016 AA. 




AC 


Q8NDE6; 




DT 


01-OCT-2002 (TrEMBLrel. 22, Created) 




DT 


01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 




DT 


01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 




DE 


Hypothetical protein (Fragment) . 




GN 


DKFZP434I245. 




OS 


Homo sapiens (Human) . 




OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 


Euteleostomi ; 


OC 


Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 


Homo . 


OX 


NCBI TaxID=9606; 





RN 
RP 
RC 
RA 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
SQ 



[1] 

SEQUENCE FROM N . A. 
TISSUE=Testis; 
Duesterhoef t A. , Lauber 



J., Mewes H.W., Weil B., Wiemann S. 



Submitted (JUL-2 002) to the EMBL/ GenBank/DDBJ databases. 

EMBL; AL833931; CAD38787.1; -. 

GO; GO: 0005509; F: calcium ion binding; IEA. 

InterPro; IPR000152; Asx_hydroxyl_S . 

InterPro; IPR001881; EGF_Ca . 

InterPro; IPR006209; EGF_like. 

InterPro; IPR006210; IEGF. 

InterPro; IPR002035; VWF_A. 

Pfam; PF00008; EGF; 10. 

Pfam; PF00092; vwa; 2. 

PRINTS; PR00453; VWFADOMAIN . 

SMART; SM00181; EGF; 10. 

SMART; SM00179; EGF_CA; 10. 

SMART; SM00327; VWA; 2. 

PROSITE; PS00010; ASX_H YDROXYL ; 9. 

PROSITE; PS01186; EGF_2; 9. 

PROSITE; PS50234; VWFA; 2. 

Hypothetical protein. 

NON_TER 1 1 

SEQUENCE 1016 AA; 113088 MW; 4BDC0DE391DA7C82 CRC64; 



Query Match 75.6%; 
Best Local Similarity 71 . 4%; 
Matches 5; Conservative 



Score 31; DB 4; Length 1016; 
Pred. No. 9.2e+02; 
2; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 

Db 



2 VFFAEDF 8 

: I : I I I I 

874 LFYAEDF 880 



RESULT 84 
Q 8 WRY 3 

ID Q8WRY3 PRELIMINARY; PRT; 1296 AA. 

AC Q8WRY3; 

DT 01-MAR-2002 (TrEMBLrel. 20, Created) 

DT 01-MAR-2002 (TrEMBLrel. 20, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Cytoadherence linked asexual protein-like protein. 

OS Plasmodium chabaudi . 

OC Eukaryota; Alveolata; Apicomplexa; Haemosporida; Plasmodium. 

OX NCBI_TaxID=5825; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN^AS; 

RA Holt D.C., Fischer K. , Tchavtchitch M. , Wilson D.W., Hauquitz N.E., 

RA Hawthorne P.L., Gardiner D.L., Trenholme K.R., Kemp D.J.; 

RT "Clags in P. falciparum and other species of Plasmodium."; 

RL Mol. Biochem. Parasitol. 118:259-263(2002). 

DR EMBL; AF387740; AAL57302.1; 

DR InterPro; IPR005553; CLAG. 

DR InterPro; IPR001202; WW_Rsp5_WWP. 

DR Pfam; PF03805; CLAG; 1. 

DR PROSITE; PS01159; WW DOMAIN 1; 1. 



SQ SEQUENCE 1296 AA; 152409 MW; 35B134DF9A54EDAD CRC64; 



Query Match 75.6%; Score 31; DB 5; Length 1296; 

Best Local Similarity 62.5%; Pred. No. 1.2e+03; 

Matches 5; Conservative 2; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

: : I I I I I 

Db 790 IIFFAEPF 797 



RESULT 
Q8NIT0 



85 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RA 
RA 
RL 
RN 
RP 
RA 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
SQ 



PRELIMINARY; 



PRT; 1364 AA. 



Q8NIT0 
Q8NIT0; 

01-OCT-2002 (TrEMBLrel. 22, Created) 

01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

Probable xanthine dehydrogenase. 

B23E9.040. 

Neurospora crassa. 

Eukaryota; Fungi; Ascomycota; Pezizomycotina; Sordariomycetes ; 
Sordariomycetidae; Sordariales; Sordariaceae; Neurospora. 
NCBI_TaxID=5141; 
[1] 

SEQUENCE FROM N.A. 

Schulte U., Aign V., Hoheisel J., Brandt P., Fartmann B., Holland 
Nyakatura G. , Mewes H.W., Mannhaupt G. ; 

Submitted (JUN-2002) to the EMBL/GenBank/DDB J databases. 
[2] 

SEQUENCE FROM N.A. 

German Neurospora genome project; 

Submitted (JUN-2002) to the EMBL/ GenBank/DDBJ databases. 
EMBL; AL807372; CAD37030.1; -. 
PIR; T51920; T51920. 

GO; GO: 0005489; F: electron transporter activity; IEA. 
GO; GO: 0016491; F: oxidoreductase activity; IEA. 
GO; GO: 0006118; P: electron transport; IEA. 



R. 



InterPro; IPR002 888 
InterPro; IPR006058 
InterPro; IPR008274 
InterPro; IPR000674 
InterPro; IPR005107 
InterPro; IPR002346 
InterPro; IPR001041; 
Pfam; PF01315 
Pfam; PF02738 
Pfam; PF03450 
Pfam; PF00941 
Pfam; PF00111 
Pfam; PF01799 



2Fe-2S_bind. 
2Fe2S_fd_BS. 
Aldxan__dh_bind . 
Aldxan_dh_hamm . 
CO_deh_f lav_C . 
dehydrog_molyb . 
Ferredoxin. 
Ald_Xan_dh_C ; 1 . 
Ald_Xan__dh_C2 ; 1 . 
CO_deh_flav_C; 1. 
FAD_binding_5 ; 1 . 
fer2; 1. 



fer2_2; 1. 
ProDom; PD18 6071; 2Fe-2S_bind; 1. 
PROSITE; PS00197; 2 FE2 S_FERREDOXIN ; 1. 
Iron; Iron-sulfur. 

SEQUENCE 1364 AA; 149832 MW; 2 82B3187A22D2 127 CRC64 ; 



Query Match 



75.6%; Score 31; DB 3; Length 1364; 



Best Local Similarity 100.0%; Pred. No. 1.2e+03; 
Matches 6; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 



2 VFFAED 7 



Db 



681 VFFAED 68 6 



RESULT 86 

Q8GJB6 

ID Q8GJB6 



PRELIMINARY; 



PRT; 



136 AA. 



AC Q8GJB6; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Hypothetical protein. 

GN CJP39. 

OS Campylobacter jejuni. 

OG Plasmid pVir. 

OC Bacteria; Proteobacteria ; Epsilonproteobacteria; Campylobacterales ; 

OC Campylobacteraceae; Campylobacter. 

OX NCBI_TaxID=197 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=81-17 6; 

RX MEDLINE-22267121; PubMed=12379703 ; 

RA Bacon D.J., Aim R.A. , Hu L., Hickey T.E.,*Ewing CP., Batchelor R.A., 

RA Trust T.J., Guerry P.; 

RT "DNA Sequence and Mutational Analyses of the pVir Plasmid of 

RT Campylobacter jejuni 81-176."; 

RL Infect. Immun. 70:6242-6250(2002). 

DR EMBL; AF226280; AAN46932.1; -. 

DR GO; GO: 0046821; C : extrachromosomal DNA; IEA. 

KW Hypothetical protein; Plasmid. 

SQ SEQUENCE 136 AA; 16129 MW; B24 99FF56A7F8F45 CRC64; 

Query Match 73.2%; Score 30; DB 2; Length 136; 

Best Local Similarity 50.0%; Pred. No. 1.9e+02; 

Matches 4; Conservative 3; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

: : I I : I I 
Db 20 IIFFGKDF 27 



AC Q8PEK4; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

DE Enterocin biosynthesis related protein. 

GN XAC4345. 

OS Xanthomonas axonopodis (pv. citri) . 

OC Bacteria; Proteobacteria; Gammaproteobacteria; Xanthomonadales ; 

OC Xanthomonadaceae; Xanthomonas. 



RESULT 87 

Q8PEK4 

ID Q8PEK4 



PRELIMINARY; 



PRT; 



137 AA. 



OX NCBI_TaxID=92829; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=306 / ATCC 13902 / XV 101; 

RX MEDLI NE=2 2022145; PubMed= 12024217; 

RA da Silva A.C.R., Ferro J. A. , Reinach F.C., Farah C.S., Furlan L.R., 

RA Quaggio R.B., Monteiro-Vitorello C.B., Van Sluys M.A., Almeida N.F., 

RA Alves L.M.C., do Amaral A.M. , Bertolini M.C., Camargo L.E.A. , 

RA Camarotte G., Cannavan F. f Cardozo J., Chambergo F. , Ciapina L.P., 

RA Cicarelli R.M. B. , Coutinho L.L., Cursino-Santos J.R., El-Dorry H., 

RA Faria J.B., Ferreira A.J.S., Ferreira R.C.C., Ferro M.I.T., 

RA Formighieri E.F., Franco M.C., Greggio C.C., Gruber A., 

RA Katsuyama A.M., Kishi L.T., Leite R.P., Lemos E.G.M., Lemos M.V.F., 

RA Locali E.G., Machado M. A. , Madeira A.M.B.N., Martinez-Rossi N.M., 

RA Martins E.C., Meidanis J. , Menck C.F.M., Miyaki C.Y., Moon D.H., 

RA Moreira L.M., Novo M.T.M., Okura V.K., Oliveira M.C., Oliveira V.R., 

RA Pereira H.A., Rossi A. , Sena J.A.D., Silva C, de Souza R.F., 

RA Spinola L.A.F., Takita M.A. , Tamura R.E., Teixeira E.C., Tezza R.I.D., 

RA Trindade dos Santos M. , Truffi D. f Tsai S.M., White F.F., 

RA Setubal J.C., Kitajima J. P.; 

RT "Comparison of the genomes of two Xanthomonas pathogens with differing 

RT host specificities."; 

RL Nature 417:459-463(2002). 

DR EMBL; AE012089; AAM39175.1; -. 

KW Complete proteome. 

SQ SEQUENCE 137 AA; 15312 MW; 94 0D07B0F5EBA04 3 CRC64; 



Query Match 73.2%; 
Best Local Similarity 57.1%; 
Matches 4; Conservative 



Score 30; DB 16; Length 137; 
Pred. No. 1.9e+02; 
3; Mismatches 0; Indels 0; 



Gaps 



0; 



Qy 

Db 



2 VFFAEDF 8 

: : I I : I I 
37 IYFADDF 43 



RESULT 88 
Q81Z55 

ID Q81Z55 PRELIMINARY; PRT; 138 AA. 

AC Q81Z55; 

DT 01-JUN-2003 (TrEMBLrel. 24, Created) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE General stress protein 26. 

GN BA0418. 

OS Bacillus anthracis (strain Ames) . 

OC Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus. 

OX NCBI_TaxID=198094; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=2 2608414; PubMed= 12721629; 

RA Read T.D., Peterson S.N., Tourasse N . , Baillie L.W., Paulsen I.T. 

RA Nelson K.E., Tettelin H., Fouts D.E., Eisen J. A., Gill S.R. , 

RA Holtzapple E.K., Okstad O.A., Helgason E., Rilstone J., Wu M. , 

RA Kolonay J.F., Beanan M.J., Dodson R.J., Brinkac L.M., Gwinn M. , 

RA DeBoy R.T., Madpu R., Daugherty S.C., Durkin A.S., Haft D.H., 

RA Nelson W.C., Peterson J.D., Pop M., Khouri H.M., Radune D., 



RA Benton J.L., Mahamoud Y., Jiang L. , Hance I.R., Weidman J.F., 

RA Berry K.J., Plaut R.D., Wolf A.M., Watkins K.L., Nierman W.C., 

RA Hazen A./ Cline R., Redmond C, Thwaite J.E., White 0., Salzberg S.L 

RA Thomason B., Friedlander A.M., Koehler T.M., Hanna P.C., Kolsto A.-B 

RA Fraser CM. ; 

RT "The genome sequence of Bacillus anthracis Ames and comparison to 

RT closely related bacteria."; 

RL Nature 423:81-86(2003). 

DR EMBL; AE017025; AAP24447.1; 

DR TIGR; BA0418; 

DR InterPro; IPR009002; FMN_binding. 

KW Complete proteome. 

SQ SEQUENCE 138 AA; 16068 MW; 01A47C4A37870CF5 CRC64; 



Query Match 73.2%; 
Best Local Similarity 62.5%; 
Matches 5; Conservative 



Score 30; DB 16; Length 138; 
Pred. No. 1.9e+02; 
2; Mismatches 1; Indels 



0; Gaps 



Qy 

Db 



1 LVFFAEDF 8 

:: I I III 
32 MMFFHEDF 39 



RESULT 8 9 
Q81ID0 

ID Q81ID0 PRELIMINARY; PRT; 138 AA. 

AC Q81ID0; 

DT 01-JUN-2003 (TrEMBLrel. 24, Created) 

DT 01-JUN-2003 {TrEMBLrel. 24, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE General stress protein 26. 

GN BC0461. 

OS Bacillus cereus (strain ATCC 14579 / DSM 31). 

OC Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus. 

OX NCBI_TaxID=22 6900; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE-22608415; PubMed=12721630 ; 

RA Ivanova N., Sorokin A., Anderson I., Galleron N., Candelon B., 

RA Kapatral V., Bhattacharyya A., Reznik G. , Mikhailova N., Lapidus A., 

RA Chu L., Mazur M., Goltsman E . , Larsen N., D'Souza M. , Walunas T . , 

RA Grechkin Y., Pusch G., Haselkorn R., Fonstein M. , Ehrlich S.D., 

RA Overbeek R. , Kyrpides N.; 

RT "Genome sequence of Bacillus cereus and comparative analysis with 

RT Bacillus anthracis."; 

RL Nature 423:87-91(2003). 

DR EMBL; AE016999; AAP07501.1; 

DR InterPro; IPR009002; FMN_binding. 

KW Complete proteome. 

SQ SEQUENCE 138 AA; 15967 MW; 1C0516EA5B56DD92 CRC64; 

Query Match 73.2%; Score 30; DB 16; Length 138; 

Best Local Similarity 62.5%; Pred. No. 1.9e+02; 

Matches 5; Conservative 2; Mismatches 1; Indels 0; Gaps 

Qy 1 LVFFAEDF 8 

: : I I I I I 



Db 



32 MMFFHEDF 39 



RESULT 90 
Q83U07 

ID Q83U07 PRELIMINARY; PRT; 153 AA. 

AC Q83U07; 

DT 01-JUN-2003 (TrEMBLrel. 24, Created) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Putative mating pair formation protein (MpfA) . 

GN MPFA. 

OS Pseudomonas putida. 

OG Plasmid pDTGl, and Plasmid pCgl. 

OC Bacteria; Proteobacteria ; Gammaproteobacteria; Pseudomonadales ; 

OC Pseudomonadaceae; Pseudomonas. 

OX NCBI_TaxID=303; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=NCIB 9816-4; PLASMI D=pDTGl ; 

RA Zylstra G.J., Dennis J. J.; 

RT "Complete nucleotide sequence of the NAH plasmid pDTGl from 

RT Pseudomonas putida NCIB 9816-4."; 

RL Submitted (DEC-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC PLASMID=pCgl; 

RA Park W. , Jeon CO., Hohnstock-Ashe A.M., Winans S.C., Zylstra G.J., 

RA Madsen E.L.; 

RT "Identification and characterization of the conjugal transfer region 

RT of the pCgl plasmid from naphthalene-degrading Pseudomonas pudia 

RT Cgl."; 

RL Submitted (MAR-2003) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF491307; AA064341.1; 

DR EMBL; AY249147; AAP06659.1; -. 

DR GO; GO: 0046821; C: extrachromosomal DNA; IEA. 

KW Plasmid. 

SQ SEQUENCE 153 AA; 16823 MW; 97DF7 8 96B474D643 CRC64; 



Query Match 73.2%; Score 30; DB 2; Length 153; 

Best Local Similarity 75.0%; Pred. No. 2.1e+02; 

Matches 6; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 



Qy 1 LVFFAEDF 8 

I I I I I : I 
Db 133 LVFFAPNF 14 0 



RESULT 
Q93RF3 



91 



PRELIMINARY ; 



ID Q93RF3 

AC Q93RF3; 

DT 01-DEC-2001 (TrEMBLrel. 19, 

DT 01-DEC-2001 (TrEMBLrel. 19, 

DT 01-DEC-2001 (TrEMBLrel. 19, 

DE Open reading frame. 

GN ORF . 



PRT; 



163 AA. 



Created) 

Last sequence update) 
Last annotation update) 



OS Treponema medium. 

OC Bacteria; Spirochaetes ; Spirochaetales ; Spirochaetaceae; Treponema. 

OX NCBI_TaxID=58231; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=ATCC7 002 93 ; 

RA Fukunaga M. ; 

f RT "A phylogenetic analysis of a human oral spirochete Treponema medin 

RT by flagellar genes."; 

RL Submitted (JUL-2000) to the EMBL/GenBank/DDB J databases. 

DR EMBL; AB046578; BAB62253.1; -. 

SQ SEQUENCE 163 AA; 18137 MW; 777247A666916E0F CRC64; 

Query Match 73.2%; Score 30; DB 2; Length 163; 

Best Local Similarity 75.0%; Pred. No. 2.3e+02; 

Matches 6; Conservative 0; Mismatches 2; Indels 0; Gap 

Qy 1 LVFFAEDF 8 

I I I I I I 
Db 152 LVFFTEKF 159 



RESULT 92 
Q92CD6 

ID Q92CD6 PRELIMINARY; PRT; 174 AA. 

AC Q92CD6; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Hypothetical protein linl236. 

GN LIN1236. 

OS Listeria innocua . 

OC Bacteria; Firmicutes ; Bacillales; Listeriaceae; Listeria. 

OX NCBI_TaxID=1642; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=CLIP 11262 / Serovar 6a; 

RX MEDLINE=21537279; PubMed=l 167 9669 ; 

RA Glaser P., Frangeul L . , Buchrieser C, Rusniok C, Amend A., 

RA Baquero F., Berche P., Bloecker H., Brandt P., Chakraborty T., 

RA Charbit A., Chetouani F. , Couve E., de Daruvar A., Dehoux P., 

RA Domann E . , Dominguez-Bernal G. , Duchaud E., Durant L., Dussurget O. 

RA Entian K.-D., Fsihi H., Garcia-del Portillo F. , Garrido P., 

RA Gautier L., Goebel W. , Gomez-Lopez N., Hain T., Hauf J., Jackson D. 

RA Jones L.-M., Kaerst U., Kreft J., Kuhn M. , Kunst F., Kurapkat G., 

RA Madueno E . , Maitournam A., Mata Vicente J., Ng E., Nedjari H., 

RA Nordsiek G. , Novella S., de Pablos B., Perez-Diaz J.-C, Purcell R. 

RA Remmel B. , Rose M., Schlueter T., Simoes N . , Tierrez A., 

RA Vazquez-Boland J. -A., Voss H . , Wehland J., Cossart P.; 

RT "Comparative genomics of Listeria species."; 

RL Science 2 94:849-852(2001). 

DR EMBL; AL596168; CAC96467.1; 

DR PIR; AC1587; AC1587. 

DR ListiList; LIN01236; -. 

KW Hypothetical protein; Complete proteome. 

SQ SEQUENCE 174 AA; 19975 MW; B18BC9ADB185EE53 CRC64 ; 



Query Match 73.2%; 
Best Local Similarity 62.5%; 
Matches 5; Conservative 



Score 30; DB 16; Length 174; 
Pred. No. 2.4e+02; 
2; Mismatches 1; Indels 0; Gap 



Qy 1 LVFFAEDF 8 

1:111 I : 

Db 145 LLFFASDY 152 



RESULT 93 
064061 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 



216 AA. 



Created) 

Last sequence update) 
Last annotation update) 



064061 PRELIMINARY; PRT; 

064061; 

01-AUG-1998 (TrEMBLrel. 07, 
01-AUG-1998 (TrEMBLrel. 07, 
01-OCT-2003 (TrEMBLrel. 25, 
Hypothetical protein. 
YOMX. 

Bacteriophage SPBc2. 
OC Viruses; dsDNA viruses, no RNA stage; Caudovirales ; Siphoviridae . 
OX NCBI_TaxID=66797; 
RN [1] 

RP SEQUENCE FROM N.A. 

RA Lazarevic V., Duesterhoeft A., Soldo B., Hilbert H., Mauel C, 
RA Karamata D. ; 

RT "The complete nucleotide sequence of the Bacillus subtilis SPbetac2 
RT prophage . " ; 

RL Submitted (AUG-1997) to the EMBL/ GenBank/DDBJ databases. 
DR EMBL; AF020713; AAC13021.1; 
DR PIR; T12812; T12812. 
KW Hypothetical protein. 

SQ SEQUENCE 216 AA; 24179 MW; FDD0ED4 6D7A5DB82 CRC64 ; 



Query Match 73.2%; 
Best Local Similarity 62.5%; 
Matches 5; Conservative 



Score 30; DB 9; Length 216; 
Pred. No. 3.1e+02; 
1; Mismatches 2; Indels 



0 ; Gap 



Qy 



Db 



1 LVFFAEDF 8 

I II II: 
180 LTFFGEDY 187 



RESULT 94 
031960 

ID 031960 PRELIMINARY; PRT; 216 AA. 

AC 031960; 

DT 01-JAN-1998 (TrEMBLrel. 05, Created) 

DT 01-JAN-1998 (TrEMBLrel. 05, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE YOMX protein. 

GN YOMX . 

OS Bacillus subtilis. 

OC Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus. 

OX NCBI_TaxID=1423; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=168; 



RX MEDLINE=98044033; PubMed=9384377 ; 

RA Kunst F., Ogasawara N., Moszer I . r Albertini A.M., Alloni G. , 

RA Azevedo V., Bertero M.G., Bessieres P., Bolotin A., Borchert S., 

RA Borriss R. , Boursier L., Brans A., Braun M. , Brignell S.C., Bron S., 

RA Brouillet S., Bruschi C.V., Caldwell Capuano V., Carter N.M., 

RA Choi S.K., Codani J.J., Connerton I.F., Cummings N.J., Daniel R.A. , 

RA Denizot F. , Devine K.M., Dusterhoft A., Ehrlich S.D., Emmerson P.T., 

RA Entian K.D., Errington J. , Fabret C, Ferrari E. r Foulger D., 

RA Fritz C, Fujita M. , Fujita Y., Fuma S., Galizzi A. r Galleron N., 

RA Ghim S.Y-, Glaser P., Goffeau A., Golightly E.J., Grandi G., 

RA Guiseppi G., Guy B.J., Haga K . , Haiech J., Harwood C.R., Henaut A. r 

RA Hilbert H., Holsappel S., Hosono S., Hullo M.F., Itaya M. , Jones L., 

RA Joris B., Karamata D., Kasahara Y., Klaerr-Blanchard M. , Klein C, 

RA Kobayashi Y. , Koetter P., Koningstein G., Krogh S., Kumano M. , 

RA Kurita K., Lapidus A., Lardinois S., Lauber J., Lazarevic V. , 

RA Lee S.M., Levine A. , Liu H., Masuda S., Mauel C, Medigue C, 

RA Medina N., Mellado R.P., Mizuno M. r Moestl D . , Nakai S., Noback M. , 

RA Noone D., O'Reilly M. , Ogawa K., Ogiwara A., Oudega B., Park S.H., 

RA Parro V., Pohl T.M., Portetelle D., Porwollik S., Prescott A.M., 

RA Presecan E., Pujic P., Purnelle B., Rapoport G., Rey M. , Reynolds S. 

RA Rieger M. , Rivolta C, Rocha E., Roche B. , Rose M. , Sadaie Y., 

RA Sato T., Scanlan E., Schleich S., Schroeter R., Scoffone F., 

RA Sekiguchi J. , Sekowska A., Seror S.J., Serror P., Shin B.S., Soldo B 

RA Sorokin A., Tacconi E., Takagi T., Takahashi H., Takemaru K. , 

RA Takeuchi M., Tamakoshi A., Tanaka T., Terpstra P., Tognoni A., 

RA Tosato V., Uchiyama S., Vandenbol M. f Vannier F. , Vassarotti A., 

RA Viari A., Wambutt R. , Wedler E . , Wedler H., Weitzenegger T., 

RA Winters P. f Wipat A., Yamamoto H., Yamane K., Yasumoto K., Yata K. , 

RA Yoshida K. , Yoshikawa H.F., Zumstein E. , Yoshikawa H., Danchin A*; 

RT "The complete genome sequence of the Gram-positive bacterium Bacillu 

RT subtilis. "; 

RL Nature 390:24 9-256(1997). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=168; 

RA Kunst F., Ogasawara N., Yoshikawa H., Danchin A.; 

RL Submitted (NOV-1997) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; Z99115; CAB14037.1; 

KW Complete proteome. 

SQ SEQUENCE 216 AA; 24179 MW; FDD0ED46D7A5DB82 CRC64; 

Score 30; DB 16; Length 216; 
Pred. No. 3.1e+02; 
1; Mismatches' 2; Indels 0; Gaps 



Query Match 73.2%; 
Best Local Similarity 62.5%; 
Matches 5; Conservative 

Qy 1 LVFFAEDF 8 

I II II: 

Db 180 LTFFGEDY 187 



RESULT 95 
016760 

ID 016760 PRELIMINARY 
AC 016760; 

DT 01-JAN-1998 (TrEMBLrel. 
DT 01-JAN-1998 (TrEMBLrel. 
DT 01-OCT-2003 (TrEMBLrel. 



PRT; 222 AA. 
05, Created) 

05, Last sequence update) 
25 , Last annotation update) 



DE Hypothetical protein. 

GN F59E11.6. 

OS Caenorhabditis elegans. 

OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea; 

OC Rhabditidae; Peloderinae; Caenorhabditis. 

OX NCBI_TaxID=6239; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Bris tol N2 ; 

RX MEDLINE=9 9 0 69613; PubMed=9851916 ; 

RA None; 

RT "Genome sequence of the nematode C. elegans : a platform for 

RT investigating biology. The C. elegans Sequencing Consortium."; 

RL Science 282:2012-2018(1998). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN-Bristol N2 ; 

RA Brads haw H. ; ' 

RT "The sequence of C. elegans cosmid F59E11."; 

RL Submitted (AUG-1997) to the EMBL/ GenBank/DDB J databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN-Bristol N2 ; 

RA Waterston R. ; 

RT "Direct Submission."; 

RL Submitted (AUG-2001) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; AF016685; AAG24142.1; -. 

DR PIR; T32121; T32121. 

DR WormPep; F59E11.6; CE11506. 

DR InterPro; IPR002542; DUF19. 

DR Pfam; PF01579; DUF19; 1. 

KW Hypothetical protein . 

SQ SEQUENCE 222 AA; 25570 MW; B8DC0919A0BDA564 CRC64 ; 

Query Match 73.2%; Score 30; DB 5; Length 222; 

Best Local Similarity 83.3%; Pred. No. 3.1e+02; 

Matches 5; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 3 FFAEDF 8 

I I : I I I 

Db 119 FFSEDF 124 



RESULT 96 
Q8ZAM4 

ID Q8ZAM4 PRELIMINARY; PRT; 258 AA. 

AC Q8ZAM4 ; 

DT 01-MAR-2002 (TrEMBLrel. 20, Created) 

DT 01-MAR-2002 (TrEMBLrel. 20, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Sec-independent protein translocase protein TatC (Hypothetical 

DE protein) . 

GN TATC OR YP03776 OR Y0454 . 

OS Yersinia pestis. 

OC Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacteriales ; 

OC Enterobacteriaceae ; Yersinia. 

OX NCBI TaxID=632; 



RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN^CO-92 / Biovar Orientalis; 

RX MEDLINE=21470413; PubMed=11586360 ; 

RA Parkhill J., Wren B.W., Thomson N.R., Titball R.W., Holden M.T.G., 

RA Prentice M.B., Sebaihia M., James K.D., Churcher C, Mungall K.L., 

RA Baker S., Basham D . , Bentley S.D., Brooks K. , Cerdeno-Tarraga A.M. , 

RA Chillingworth T. , Cronin A., Davies R.M., Davis P., Dougan G,, 

RA Feltwell T., Hamlin N., Holroyd S., Jagels K., Karlyshev A.V., 

RA Leather S., Moule S . , Oyston P.C.F., Quail M. , Rutherford K., 

RA Simmonds M. , Skelton J. , Stevens K. , Whitehead S., Barrell B.G.; 

RT "Genome sequence of Yersinia pestis, the causative agent of plague." 

RL Nature 413:523-527(2001). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=KIM5 / Biovar Mediaevalis; 

RX MEDLINE=22137863; PubMed=12142430 ; 

RA Deng W., Burland V. , Plunkett G. Ill, Boutin A., Mayhew G.F., Liss P 

RA Perna N.T., Rose D.J., Mau B., Zhou S., Schwartz D.C., 

RA Fetherston J.D., Lindler L.E., Brubaker R.R., Piano G.V., 

RA Straley S.C., McDonough K.A. , Nilles M.L., Matson J.S., Blattner F.R 

RA Perry R. D. ; 

RT "Genome sequence of Yersinia pestis KIM."; 

RL J. Bacterid. 184:4601-4611(2002). 

DR EMBL; AJ414158; CAC93243.1; -. 

DR EMBL; AE013646; AAM84043.1; -. 

DR PIR; AG0459; AG0459. 

DR InterPro; IPR008277; Tat_C. 

DR InterPro; IPR002033; Translocase. 

DR Pfam; PF00902; TatC; 1. 

DR TIGRFAMs; TIGR00945; tatC; 1. 

DR PROSITE; PS01218; TATC; 1. 

KW Hypothetical protein; Complete proteome. 

SQ SEQUENCE 258 AA; 28725 MW; 7B052D8306E6FDA6 CRC64; 

Query Match 73.2%; Score 30; DB 16; Length 258; 

Best Local Similarity 85.7%; Pred. No. 3.7e+02; 

Matches 6; Conservative 0; Mismatches 1; Indels 0; Gaps 

Qy • 1 LVFFAED 7 

I I I I I I 
Db 34 LVFFAND 40 



RESULT 97 


Q8CUH9 


ID 


Q8CUH9 PRELIMINARY; PRT; 282 AA. 


AC 


Q8CUH9; 


DT 


01-MAR-2003 (TrEMBLrel. 23, Created) 


DT 


01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 


DT 


01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 


DE 


Spore coat polysaccharide synthesis (dTDP-4-dehydrorhamnose 


DE 


reductase) . 


GN 


OB1128 . 


OS 


Oceanobacillus iheyensis. 


OC 


Bacteria; Firmicutes; Bacillales; Bacillaceae; Oceanobacillus. 


OX 


NCBI TaxID=182710; 



RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=HTE8 3 1 / DSM 14371 / JCM 11309; 

RX MEDLINE=22220767; PubMed=12235376; 

RA Takami H., Takaki Y., Uchiyama I.; 

RT "Genome sequence of Oceanobacillus iheyensis isolated from the Iheya 

RT Ridge and its unexpected adaptive capabilities to extreme 

RT environments . " ; 

RL Nucleic Acids Res. 30:3927-3935(2002). 

DR EMBL; AP004596; BAC13084.1; -. 

DR GO; GO: 0008831; F: dTDP-4-dehydrorhamnose reductase activity; IEA. 

DR GO; GO: 0045226; P : extracellular polysaccharide biosynthesis; IEA. 

DR InterPro; IPR005913; TDP_rham_reduct . 

DR Pfam; PF04321; RmlD_sub_bind; 1. 

DR TIGRFAMs; TIGR01214; rmlD; 1. 

KW Complete proteome. 

SQ SEQUENCE 282 AA; 32214 MW; F53A8BE4CCD17075 CRC64 ; 



Query Match 73.2%; 
Best Local Similarity 62.5%; 
Matches 5; Conservative 



Score 30; DB 16; Length 282; 
Pred. No. 4e+02; 
2; Mismatches 1; Indels 



0; Gaps 



Qy 1 LVFFAEDF 8 

MM: I : 
Db 99 LVFFSSDY 106 



RESULT 9 8 
Q838Z8 

ID Q838Z8 PRELIMINARY; PRT; 311 AA. 

AC Q838Z8; 

DT 01-JUN-2003 (TrEMBLrel . 24, Created) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Cation efflux family protein. 

GN EF0280. 

OS Enterococcus faecalis (Streptococcus faecalis) . 

OC Bacteria; Firmicutes ; Lactobacillales ; Enterococcaceae; Enterococcus 

OX NCBI_TaxID=1351; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=V583 / ATCC 700802; 

RX MEDLINE-22550857; PubMed=12663927 ; 

RA Paulsen I.T., Banerjei L., Myers G.S.A., Nelson K.E., Seshadri R. , 

RA Read T.D., Fouts D.E., Eisen J. A., Gill S.R., Heidelberg J.F., 

RA . Tettelin H . , Dodson R.J., Umayam L., Brinkac L., Beanan M. , 

RA Daugherty S., DeBoy R.T., Durkin S., Kolonay J., Madupu R. , Nelson W 

RA Vamathevan J., Tran B., Upton J., Hansen T., Shetty J., Khouri H. f 

RA Utterback T., Radune D. , Ketchum K.A., Dougherty B.A. , Fraser CM. ; 

RT "Role of mobile DNA in the evolution of vancomycin-resistant 

RT Enterococcus faecalis."; 

RL Science 299:2071-2074(2003). 

DR EMBL; AE016947; AAO80143.1; 

DR TIGR; EF0280; 

DR GO; GO: 0016020; C:membrane; IEA. 

DR GO; GO: 0008324; F: cation transporter activity; IEA. 

DR GO; GO: 0006812; P: cation transport; IEA. 



if 



DR InterPro; IPR002524; Cation_ef f lux . 

DR Pfam; PF01545; Cation_ef f lux; 1. 

DR TIGRFAMs; TIGR01297; CDF; 1. 

KW Complete proteome. 

SQ SEQUENCE 311 AA; 33897 MW; E2C76C9D508BA875 CRC64; 



Query Match 73.2%; Score 30; DB 16; Length 311; 

Best Local Similarity 62.5%; Pred. No. 4.4e+02; 

Matches 5; Conservative 1; Mismatches 2; Indels 



0; Gaps 



0; 



Qy 

Db 



1 LVFFAEDF 8 

I : I III 
163 LIIFTEDF 170 



RESULT 99 
Q9KE67 



ID 

AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OX 
RN 
RP 
RC 
RX 
RA 
RA 
RA 
RT 
RT 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
SQ 



327 AA. 



Q9KE67 PRELIMINARY; PRT; 

Q9KE67; 

01-OCT-2000 (TrEMBLrel. 15, Created) 

01-OCT-2000 (TrEMBLrel. 15, Last sequence update) 

01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

ABC transporter (Substrate-binding protein) . 

BH0990. 

Bacillus halodurans . 

Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus. 
NCBI__TaxID=86665; 

[1] 

SEQUENCE FROM N.A. 
STRAIN=C-125 / JCM 9153; 
MEDLINE=20512582; PubMed=11058132 ; 

Takami H., Nakasone K., Takaki Y. , Maeno G., Sasaki R. , Masui N . , 
Fuji F., Hirama C, Nakamura Y. , Ogasawara N., Kuhara S., 
Horikoshi K. ; 

"Complete genome sequence of the alkaliphilic bacterium Bacillus 
halodurans and genomic sequence comparison with Bacillus subtilis."; 
Nucleic Acids Res. 28:4317-4331(2000). 
EMBL; AP001510; BAB04709.1; 
PIR; F83773; F83773. 

GO; GO: 0005381; F:iron ion transporter activity; IEA. 

GO; GO: 0006827; P:high affinity iron ion transport; IEA. 

InterPro; IPR002491; Peripla_BP. 

InterPro; IPR000437 ; Prok_lipoprot_S . 

Pfam; PF01497; Peripla_BP_2 ; 1. 

PROSITE; PS00013; PROKAR_LIPOPROTEIN; 1. 

Complete proteome . 

SEQUENCE 327 AA; 36362 MW; 8254A0BF90873F17 CRC64; 



Query Match 73.2%; Score 30; DB 16; Length 327; 

Best Local Similarity 71.4%; Pred. No. 4.7e+02; 

Matches 5; Conservative 1; Mismatches 1; Indels 0; 



Gaps 



0; 



QY 
Db 



263 



VFFAEDF 8 

: I I I I I 

IFLAEDF 269 



RESULT 100 
Q803V8 

ID Q803V8 PRELIMINARY; PRT; 343 AA. 

AC Q803V8; 

DT 01-JUN-2003 (TrEMBLrel. 24, Created) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

,_DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Similar to M025 protein. 

OS Brachydanio rerio (Zebrafish) (Danio rerio) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii ; Neopterygii; Teleostei; Ostariophysi; Cyprini formes ; 

OC Cyprinidae; Danio. 

OX NCBI_TaxID=7955; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R. ; 

RL Submitted (JAN-2003) to the EMBL/GenBank/DDB J databases. 

DR EMBL; BC044172; AAH44172.1; 

DR InterPro; IPR008938; ARM. 

DR InterPro; IPR004892; Mo25. 

DR Pfam; PF03204; Mo25; 1. 

SQ SEQUENCE 343 AA; 39820 MW; F10450DA0446268A CRC64 ; 

Query Match 73.2%; Score 30; DB 13; Length 343; 

Best Local Similarity 62.5%; Pred. No. 4.9e+02; 

Matches 5; Conservative 2; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

: I I : I I I 
Db 160 IVLFSEDF 167 

Search completed: February 28, 2004, 08:56:41 
Job time : 112.5 sees 



i 

J 



r 

V 



cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
KW 



diminishes or stops. Has also significant hyperglycemic hormone 
(CHH) activity. 
-!- SUBCELLULAR LOCATION: Secreted. 

-!- MASS SPECTROMETRY: MW=9193.3; MW_ERR=0.53; METHOD=Electrospray . 
-!- SIMILARITY: Belongs to the arthropod CHH/MIH/GIH/VIH hormone 
family . 

InterPro; IPR001166; CHH_MIH_GIH. 
Pfam; PF01147; Crust_neurohorm; 1. 
PRINTS; PR00550; HYPRGLYCEMIC . 
PROSITE; PS01250; CHH_MIH_GIH; 1. 
Neuropeptide; Hormone . 



FT DISULFID 


7 


44 


BY SIMILARITY. 




FT DISULFID 


24 


40 


BY SIMILARITY. 




FT DISULFID 


27 


53 


BY SIMILARITY. 




SQ SEQUENCE 


78 AA; 


9200 MW; 


7285FB5FC8BA98C6 


CRC64; 


Query Match 




65.9%; 


Score 27; DB 1; 


Length 78; 


Best Local 


Similarity 


83.3%; 


Pred. No. 54; 




Matches 


5; Conservative 


0 ; Mismatches 


1; Indels 


Qy 3 


FFAEDF 8 








Db 45 


II III 
FFNEDF 50 









0; Gaps 



0; 



RESULT 100 
MIH CALSI 



ID 
AC 
DT 
DT 
DT 
DE 
OS 
OC 
OC 
OC 
OX 
RN 
RP 
RC 
RX 
RA 
RT 
RT 
RL 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 



113 AA. 



MIH_CALSI STANDARD; PRT; 

P55321; Q23738; 

01-OCT-1996 (Rel. 34, Created) 
01-OCT-1996 (Rel. 34, Last sequence update) 
10-OCT-2003 (Rel. 42, Last annotation update) 
Molt-inhibiting hormone precursor (MIH) - 
Callinectes sapidus (Blue crab) . 

Eukaryota; Metazoa; Arthropoda; Crustacea; Malacostraca; 
Eumalacostraca; Eucarida; Decapoda; Pleocyemata; Brachyura; 
Eubrachyura; Portunoidea; Portunidae; Callinectes. 
NCBI_TaxID=67 63; 
[1] 

SEQUENCE FROM N.A. 
TISSUE=Eyestalk; 

MEDLINE=95251700; PubMed=75374 97 ; 

Lee K.J., Elton T.S., Bej A.K., Watts S.A. , Watson R.D.; 
"Molecular cloning of a cDNA encoding putative molt-inhibiting 
hormone from the blue crab, Callinectes sapidus."; 
Biochem. Biophys . Res. Commun. 209:1126-1131(1995). 

-!- FUNCTION: Inhibits Y-organs where molting hormone (ecdysteroid) 
is secreted. A molting cycle is initiated when MIH secretion 
diminishes or stops (By similarity) . 
SUBCELLULAR LOCATION: Secreted. 

SIMILARITY: Belongs to the arthropod CHH/MIH/GIH/VIH hormone 
family. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 



- ! - 

_ t - 



CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U19764; AAA69029.1; -. 

DR PIR; JC4143; JC4143. 

DR InterPro; IPR001166; CHH_MIH__GIH . 

DR Pfam; PF01147; Crus t_neurohorm; 1. 

DR PRINTS; PR00550; HYPRGLYCEMIC . 



DR 


PROSITE; 


PS01250; 


CHH_MIH_ 


_GIH; 1. 


KW 


Signal; Neuropeptide; Hormone. 


FT 


SIGNAL 


1 


35 


POTENTIAL. 


FT 


PEPTIDE 


36 


113 


MOLT- INHIBITING HORMONE. 


FT 


DISULFID 


42 


79 


BY SIMILARITY. 


FT 


DISULFID 


59 


75 


BY SIMILARITY. 


FT 


DISULFID 


62 


88 


BY SIMILARITY. 


SQ 


SEQUENCE 


113 AA; 


12850 


MW; 86AF3AE512130A5C CRC64; 



Query Match 65.9%; Score 27; DB 1; Length 113; 

Best Local Similarity 83.3%; Pred. No. 78; 

Matches 5; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 3 FFAEDF 8 

II III 

Db 80 FFNEDF 85 



Search completed: February 28, 2004, 08:53:02 
Job time : 31.5 sees 



Copyright 



GenCore 
(c) 1993 



version 
- 2004 



5.1.6 

Compugen Ltd. 



OM protein - protein search, using sw model 



Run on : 



February 28, 2004, 08:36:39 ; Search time 60.5 Seconds 

(without alignments) 
41.721 Million cell updates/sec 



Title: US-09-668-314C-84 

Perfect score: 41 

Sequence: 1 LVFFAEDF 8 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



1017041 seqs, 315518202 residues 



Total number of hits satisfying chosen parameters 



1017041 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: 



Minimum Match 0% 

Maximum Match 100% 

Listing first 1000 summaries 



Database : 



SPTREMBL 25:* 



1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 



sp_archea:* 
sp_bacteria : * 
sp__f ungi : * 
sp_human : * 
sp_invertebrate: * 
sp_mammal : * 
sp_mhc : * 
sp__organelle : * 
sp_phage : * 

sp^plant:* 

sp_rodent : * 

sp_virus : * 

sp_vertebrate : * 

sp unclassified:* 

sp_rvirus : * 

sp__bacteriap : * 

sp archeap:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB 



ID 



Description 



1 


35 


85. 


4 


28 


4 


Q9UCD1 


Q9ucdl homo sapien 


2 


35 


85. 


4 


30 


4 


Q9UCA9 


Q9uca9 homo sapien 


3 


35 


85. 


4 


33 


4 


Q9UC33 


Q9uc33 homo sapien 


4 


35 


85. 


4 


79 


11 


035463 


035463 cricetulus 


5 


35 


85 . 


4 


82 


4 


Q16020 
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68. 


3 


504 


16 


Q87PF0 


Q87pf0 vibrio para 


443 


28 


68. 


3 


509 


10 


Q9C8 J7 


Q9c8j7 arabidopsis 


444 


28 


68. 


3 


513 


16 


Q8EMQ1 


Q8emql oceanobacil 


445 


28 


68. 


3 


516 


2 


Q8VMG2 


Q8vmq2 pseudomonas 


446 


28 


68. 


3 


518 


16 


Q9RKZ7 


Q9rkz7 streptomyce 


447 


28 


68. 


3 


523 


2 


P95618 


P95618 rhodocyclus 


448 


28 


68. 


3 


525 


2 


Q9JPB5 


Q9jpb5 rhodocyclus 


449 


28 


68. 


3 


529 


10 


Q9FMV0 


Q9fmv0 arabidopsis 


450 


28 


68. 


3 


535 


16 


Q926G8 


Q92 6g8 rhizobium m 


451 


28 


68. 


3 


537 


2 


P96429 


P96429 rhizobium m 


452 


28 


68. 


3 


541 


16 


Q7WPA4 


Q7wpa4 bordetella 


453 


28 


68. 


3 


551 


16 


Q7UDT0 


Q7udt0 shigella fl 


454 


28 


68. 


3 


552 


16 


Q8XA02 


Q8xa02 escherichia 


455 


28 


68. 


3 


554 


2 


Q8RTF6 


Q8rtf6 oenococcus 


456 


28 


68. 


3 


554 


10 


Q9M150 


Q9ml50 arabidopsis 



457 


28 


68. 


3 


555 


2 


Q8RTF7 


Q8rtf7 oenococcus 


458 


28 


68. 


3 


555 


16 


Q97E08 


Q97e08 Clostridium 


459 


28 


68. 


3 


558 


16 


Q8FL8 0 


Q8fl80 escherichia 


460 


28 


68. 


3 


558 


16 


Q83SP3 


Q83sp3 shigella fl 


461 


28 


68. 


3 


559 


10 


004614 


004614 arabidopsis 


4 62 


28 


68. 


3 


559 


16 


083432 


083432 treponema p 


463 


28 


68. 


3 


561 


10 


Q9LTJ0 


Q91ti0 arabidopsis 


464 


28 


68. 


3 


564 


16 


Q89R39 


Q89r39 bradyrhizob 


465 


28 


68 . 


3 


574 


16 


Q8CVD8 


Q8cvd8 leptospira 


466 


28 


68 . 


3 


590 


8 


Q34192 


Q34192 crithidia o 


467 


28 


68 . 


3 


590 


10 


Q9STC7 


Q9stc7 dunaliella 


468 


28 


68 . 


3 


594 


17 


Q8TIC1 


Q8ticl methanosarc 


469 


28 


68 . 


3 


600 


9 


08LTT3 


Q81tt3 bacteriopha 


470 


28 


68 . 


3 


600 


9 


09AZC9 


Q9azc9 bacteriopha 


471 


28 


68 . 


3 


600 


16 


O8YJX0 


Q8yjx0 anabaena sp 


472 


28 


68 . 


3 


623 


16 


09PJ67 


09pi 67 campvlobact 


473 


28 


68 . 


3 


628 


5 


Q86N93 


Q86n93 nephila ina 


474 


28 


68 . 


3 


631 


16 


Q7UZQ2 


Q7uzq2 prochloroco 


475 


28 


68 . 


3 


634 


16 


07V9R2 


Q7v9r2 prochloroco 


4 76 


28 


68 


3 


639 

\J 


16 


08XP76 


Q8xp76 Clostridium 


All 


28 


68 . 


3 


640 


16 


Q8XKR0 


Q8xkr0 Clostridium 


478 


28 


68 . 


3 


646 


4 


Q8NEJ2 


Q8nei2 homo sapien 

J It 


479 


28 


68 . 


3 


662 


16 


08D762 


Q8d7 62 vibrio vuln 


4 80 


28 


68 . 


3 


670 


4 


Q9Y6R3 


Q9y6r3 homo sapien 


481 

T W -L. 


28 


68 . 


3 


695 

\j 


13 


098SF9 


Q98sf9 xenopus lae 


4 82 


28 


68 . 


3 


695 


13 


O7ZXO0 


Q7zxq0 xenopus lae 


483 


28 


68 . 


3 


698 


11 


080ZR9 


08azr9 mus musculu 


484 


28 


68 . 


3 


701 


16 


089X18 


Q89xi8 bradyrhizob 


4 8 5 


28 


68 


3 


703 


5 


09G056 


09ctq56 corcvra ceo 


4 86 


28 


68 


3 


704 


11 


09DA98 


Q9da98 miis musculu 


487 


28 


68 . 


3 


709 


11 


Q99P25 


Q99p25 mus musculu 


488 


28 


68 . 


3 


721 


12 


09YY01 


09vvo:l avian adeno 


489 


28 


68 . 


3 


728 


10 


O8S0A6 


Q8s0a6 oryza sativ 


4 90 


28 


68 


3 


752 


5 


025518 


Q2 5518 manduca sex 


491 


28 


68 


3 


752 


5 


025517 

»w/ v^/ ' 


Q25517 manduca sex 


492 


28 


68 . 


3 


752 


5 


Q25516 


Q25516 manduca sex 


493 


28 


68 . 


3 


757 


5 


Q960E5 


Q960e5 drosophila 


494 


28 


68 . 


3 


760 


16 


Q89XI7 


Q89xi7 bradyrhizob 


495 


28 


68 . 


3 


761 


5 


09W557 


Q9w557 drosophila 


496 


28 


68 . 


3 


765 


5 


O963T0 


Q963t0 plodia inte 


497 


28 


68 . 


3 


876 


16 


Q8XIV2 


Q8xiv2 Clostridium 


498 


28 


68 . 


3 


934 


16 


Q88WB1 


Q88wbl lactobacill 


499 

^1 -^s 


28 


68 . 


3 


995 


4 


09NRZ1 


Q9nrzl homo sapien 


500 

■^J \J V/ 


28 


68 . 


3 


1020 


2 


Q9F7M0 


Q9f7m0 qamma-prote 


501 


28 


68. 


3 


1027 


16 


Q8Z028 


Q8z028 anabaena sp 


502 


28 


68. 


3 


1035 


4 


015153 


015153 homo sapien 


503 


28 


68. 


3 


1035 


6 


097915 


097915 oryctolagus 


504 


28 


68. 


3 


1035 


11 


Q9R1C4 


Q9rlc4 mus musculu 


505 


28 


68. 


3 


1035 


11 


035422 


035422 rattus norv 


506 


28 


68 . 


3 


1035 


11 


054815 


054815 rattus norv 


507 


28 


68. 


3 


1035 


13 


013134 


013134 ambystoma t 


508 


28 


68. 


3 


1079 


4 


Q9Y6R1 


Q9y6rl homo sapien 


509 


28 


68. 


3 


1079 


4 


Q9UIC0 


Q9uic0 homo sapien 


510 


28 


68. 


3 


1079 


4 


Q9UIC1 


Q9uicl homo sapien 


511 


28 


68. 


3 


1079 


4 


Q9H262 


Q9h2 62 homo sapien 


512 


28 


68. 


3 


1079 


4 


Q9UP50 


Q9up50 homo sapien 


513 


28 


68. 


3 


1079 


6 


Q9GL77 


Q9gl77 bos taurus 



514 


28 


68. 


3 


1079 


6 


Q9XSZ4 


Q9xsz4 oryctolagus 


515 


28 


68. 


3 


1079 


11 


Q9QXH6 


Q9qxh6 rattus norv 


516 


28 


68. 


3 


1079 


11 


088343 


08834 3 mus musculu 


517 


28 


68. 


3 


1079 


11 


Q9JI66 


Q9ii66 rattus norv 


518 


28 


68. 


3 


1079 


11 


Q9QYA6 

Km* Km* 


Q9qya6 rattus norv 


519 


28 


68. 


3 


1088 


10 


Q9C9D7 


Q9c9cl7 arabidopsis 


520 


28 


68. 


3 


1094 


11 


Q9JJ32 


Q9ji32 rattus norv 


521 


28 


68. 


3 


1146 


5 


Q8WSX5 


Q8wsx5 Paramecium 


522 


28 


68. 


3 


1150 


5 


Q9W0H4 


Q9w0h4 drosophila 


523 


28 


68. 


3 


1160 


5 


Q27829 


Q27829 Paramecium 


524 


28 


68. 


3 


1200 


5 


Q8T5I7 


Q8t5i7 anopheles q 


525 


28 


68. 


3 


1251 


5 


016568 


016568 caenorhabdi 


526 


28 


68. 


3 


1405 


5 


Q9N694 


Q9n694 toxoplasma 


527 


28 


68. 


3 


1450 


11 


054728 


054728 rattus norv 


528 


28 


68. 


3 


1653 


2 


Q9LBG3 


Q91bg3 streptococc 


529 


28 


68. 


3 


1721 


5 


Q9UAR3 

Km, 


Q9uar3 pacifastacu 


530 


28 


68. 


3 


2400 


5 


Q8I3U7 


Q8i3u7 Plasmodium 


531 


28 


68. 


3 


4199 


16 


P74440 


P74440 synechocyst 


532 


28 


68. 


3 


5687 


5 


Q8IJH4 


Q8iih4 Plasmodium 


533 


28 


68. 


3 


5910 


5 


Q8IAP1 


Q8iapl plasmodium 


534 


27.5 


67. 


1 


516 


16 


Q9K8P6 


Q9k8p6 bacillus ha 


535 


27.5 


67. 


1 


516 


16 


Q8RAJ4 


Q8rai4 thermoanaer 


536 


27 


65. 


9 


36 


16 


Q9KLP7 


Q9klp7 vibrio chol 


537 


27 


65. 


9 


53 


9 


080171 


O80171 streptococc 


538 


27 


65. 


9 


58 


17 


Q8ZV06 


Q8zv0 6 pvrobaculum 

' iT mml 


539 


27 


65. 


9 


61 


16 


Q9PEN8 


Q9pen8 xylella fas 


540 


27 


65. 


9 


61 


16 


Q8EJA5 


Q8eia5 shewanella 


541 


27 


65. 


9 


61 


16 


Q81GQ4 


Q81gq4 bacillus ce 

mm* mL 


542 


27 


65. 


9 


75 


16 


051426 


051426 borrelia bu 


543 


27 


65. 


9 


79 


16 


Q8A8P7 


Q8a8p7 bacteroides 


544 


27 


65. 


9 


79 


16 


Q8A554 


Q8a554 bacteroides 


545 


27 


65. 


9 


81 


16 


Q9X0Z5 


Q9x0z5 thermotoga 


546 


27 


65. 


9 


82 


16 


Q8KB13 


Q8kbl3 chlorobium 


547 


21 


65. 


9 


88 


16 


Q7UR07 


Q7ur07 rhodopirell 


548 


27 


65. 


9 


91 


12 


Q9DUB2 


Q9dub2 tt virus, o 


549 


27 


65. 


9 


92 


16 


Q9HWW8 


Q9hww8 pseudomonas 


550 


27 


65. 


9 


95 


16 


Q8ZA71 


Q8za71 yersinia pe 

mml J- 


551 


27 


. 65. 


9 


100 


7 


098034 


O98034 homo sapien 


552 


27 


65. 


9 


101 


10 


Q9LIY8 


Q91iy8 oryza sativ 


553 


27 


65. 


9 


104 


5 


Q8T4J5 


Q8t4j5 scylla serr 


554 


27 


65. 


9 


104 


16 


Q8YTV9 


Q8ytv9 anabaena sp 


555 


27 


65. 


9 


105 


9 


Q7Y598 


Q7y598 bacteriopha 

ml IT 


556 


27 


65. 


9 


111 


16 


Q9ACQ0 


Q9acq0 streptomyce 


557 


27 


65. 


9 


113 


5 


Q9GYE8 


Q9gye8 cancer pagu 

mm* mm\ 1+ — * 


558 


27 


65. 


9 


119 


16 


Q8D9V8 


Q8d9v8 vibrio vuln 


559 


27 


65. 


9 


120 


10 


Q8L4B5 


Q814b5 oryza sativ 


560 


27 


65. 


9 


122 


2 


Q7WX48 


Q7wx48 alcaligenes 


561 


27 


65. 


9 


122 


6 


077630 


O77630 bos taurus 


562 


27 


65. 


9 


128 


16 


Q9RST6 


Q9rst6 deinococcus 


563 


27 


65. 


9 


128 


16 


Q9I430 


Q9i430 pseudomonas 


564 


27 


65. 


9 


133 


16 


Q9ADJ5 


Q9adj5 streptomyce 


565 


27 


65. 


9 


136 


16 


Q987H0 


Q987h0 rhizobium 1 


566 


27 


65. 


9 


137 


16 


Q8Y6R3 


Q8y6r3 listeria mo 


567 


27 


65. 


9 


137 


16 


Q87LG2 


Q871g2 vibrio para 


568 


27 


65. 


9 


140 


9 


Q8SC78 


Q8sc78 stx2 conver 


569 


27 


65. 


9 


140 


9 


Q7Y384 


Q7y384 stxl conver 


570 


27 


65. 


9 


140 


9 


Q7Y2S3 


Q7y2s3 stx2 conver 



571 


27 


65. 


9 


140 


16 


Q9KUU1 


Q9kuul vibrio chol 


572 


27 


65. 


9 


140 


16 


Q8DCF3 


Q8dcf3 vibrio vuln 


573 


27 


65. 


9 


142 


2 


Q9RII2 


Q9rii2 streptococc 


574 


27 


65. 


9 


143 


16 


Q8XS44 


Q8xs44 ralstonia s 


575 


27 


65. 


9 


143 


16 


Q7WNA3 


Q7wna3 bordetella 


576 


27 


65. 


9 


143 


16 


Q7W0X9 


Q7w0x9 bordetella 


577 


27 


65. 


9 


146 


16 


Q99XW3 


Q99xw3 streptococc 


578 


27 


65. 


9 


146 


16 


Q8NZ92 


Q8nz92 streptococc 


579 


27 


65. 


9 


148 


5 


Q9VUG5 


Q9vug5 drosophila 


580 


27 


65. 


9 


149 


17 


051955 


051955 halobacteri 


581 


27 


65. 


9 


150 


16 


Q9RVS4 


Q9rvs4 deinococcus 


582 


27 


65. 


9 


153 


16 


Q99TR0 


Q99tr0 staphylococ 


583 


27 


65. 


9 


154 


16 


Q831M9 


Q831m9 enterococcu 


584 


27 


65. 


9 


155 


5 


Q8I0Z4 


Q8i0z4 caenorhabdi 


585 


27 


65. 


9 


158 


2 


Q49737 


Q49737 mycobacteri 


586 


27 


65. 


9 


158 


16 


Q98LD9 


Q981d9 rhizobium 1 


587 


27 


65. 


9 


159 


13 


Q8AYG7 


Q8ayq7 qallus gall 


588 


27 


65. 


9 


160 


16 


Q99VY8 


Q99vv8 staphylococ 


589 


27 


65. 


9 


160 


16 


Q8NXT0 


Q8nxt0 staphylococ 


590 


27 


65. 


9 


161 


2 


030617 


030617 mvxococcus 


591 


27 


65. 


9 


164 


2 


Q8GMX4 


Q8gmx4 methanotrop 


592 


27 


65. 


9 


164 


11 


Q8BHP4 


Q8bhp4 mus musculu 


593 


27 


65. 


9 


167 


16 


Q9PM12 


Q9pml2 campylobact 

IT It J 


594 


27 


65. 


9 


176 


2 


Q8RT79 


Q8rt79 listeria mo 


595 


27 


65. 


9 


186 


16 


055539 


055539 svnechocvst 


596 


27 


65. 


9 


186 


16 


Q92BN0 


Q92bn0 listeria in 


597 


27 


65. 


9 


186 


16 


Q8Y739 


Q8y739 listeria mo 


598 


27 


65. 


9 


187 


16 


Q81SY7 


Q81sy7 bacillus an 


599 


27 


65. 


9 


187 


16 


Q81FT3 


Q81ft3 bacillus ce 


600 


27 


65. 


9 


188 


16 


Q8F442 


Q8f442 leptospira 


601 


27 


65. 


9 


189 


5 


Q8IP62 


Q8ip62 drosophila 


602 


27 


65. 


9 


190 


17 


Q9HSH9 


Q9hsh9 halobacteri 


603 


27 


65. 


9 


193 


16 


Q8YL39 


Q8vl39 anabaena sp 


604 


27 


65. 


9 


196 


5 


Q9VUM4 


09vum4 drosophila 


605 


27 


65. 


9 


198 


2 


Q8GKA2 


Q8gka2 helicobacte 


606 


27 


65. 


9 


200 


11 


Q8CF80 


Q8cf80 mus musculu 


607 


27 


65. 


9 


203 


4 


Q9H3I4 


Q9h3i4 homo sapien 


608 


27 


65. 


9 


204 


16 


Q97IZ2 


Q97iz2 Clostridium 


609 


27 


65. 


9 


205 


16 


Q81SM2 


Q81sm2 bacillus an 


610 


27 


65. 


9 


215 


16 


Q87IG4 


Q87ig4 vibrio para 


611 


27 


65. 


9 


216 


11 


Q8BHN8 


Q8bhn8 mus musculu 


612 


27 


65. 


9 


218 


16 


Q89W59 


Q89w59 bradyrhizob 


613 


27 


65. 


9 


219 


16 


Q81FI4 


Q81fi4 bacillus ce 


614 


27 


65. 


9 


221 


17 


058889 


058889 pyrococcus 


615 


27 


65. 


9 


222 


16 


Q8PK19 


Q8pkl9 xanthomonas 


616 


27 


65. 


9 


224 


12 


Q9Q8L0 


Q9q810 myxoma viru 


617 


27 


65. 


9 


224 


12 


Q9Q8Y8 


Q9q8y8 shope fibro 


618 


27 


65. 


9 


224 


16 


Q9AC07 


Q9ac07 caulobacter 


619 


27 


65. 


9 


225 


9 


Q7Y4L0 


Q7y410 streptococc 


620 


27 


65. 


9 


228 


16 


Q97KT2 


Q97kt2 Clostridium 


621 


27 


65. 


9 


230 


2 


Q7X0C5 


Q7x0c5 gamma-prote 


622 


27 


65. 


9 


230 


4 


Q7Z5B1 


Q7z5bl homo sapien 


623 


27 


65. 


9 


230 


16 


Q9PHW2 


Q9phw2 campylobact 


624 


27 


65. 


9 


233 


5 


044644 


044644 caenorhabdi 


625 


27 


65. 


9 


234 


16 


Q99YB2 


Q99yb2 streptococc 


626 


27 


65. 


9 


235 


1 


P71164 


P71164 desulfuroco 


627 


27 


65. 


9 


236 


2 


Q9FB08 


Q9fb08 mycoplasma 



628 


27 


65. 


9 


236 


4 


Q8WUD3 


Q8wud3 homo sapien 


629 


27 


65. 


9 


236 


11 


Q8BMU2 


Q8bmu2 mus musculu 


630 


27 


65. 


9 


236 


11 


Q9D4V7 


Q9d4v7 mus musculu 


631 


27 


65. 


9 


239 


3 


Q92387 


Q92387 triqonopsis 


632 


27 


65. 


9 


239 


16 


Q9RL46 


Q9rl46 streptomyce 


633 


27 


65. 


9 


240 


3 


093877 


093877 fusarium ox 


634 


27 


65. 


9 


240 


16 


Q822E0 


Q822e0 chlamydophi 


635 


27 


65. 


9 


242 


3 


Q04701 


Q04701 fusarium so 


636 


27 


65. 


9 


242 


4 


Q7Z2Y4 


Q7z2y4 homo sapien 


637 


27 


65. 


9 


245 


16 


Q9CKD2 


Q9ckd2 pasteurella 


638 


27 


65. 


9 


246 


16 


Q83L46 


Q83146 shigella fl 


639 


27 


65. 


9 


248 


16 


Q8ZG63 


Q8zg63 yersinia pe 


640 


27 


65. 


9 


249 


16 


Q8YL29 


Q8yl29 anabaena sp 


641 


27 


65. 


9 


253 


16 


Q83B98 


Q83b98 coxiella bu 


642 


27 


65. 


9 


254 


4 


Q7Z5B2 


Q7z5b2 homo sapien 


643 


27 


65. 


9 


255 


10 


Q8LFD3 


Q81fd3 arabidopsis 


644 


27 


65. 


9 


258 


10 


Q84SJ7 


Q84sj7 oryza sativ 


645 


27 


65. 


9 


258 


16 


Q8REW6 


Q8rew6 fusobacteri 


646 


27 


65. 


9 


258 


16 


Q8FBI6 


Q8fbi6 escherichia 


647 


27 


65. 


9 


258 


16 


Q8CT67 


Q8ct67 staphylococ 

*~ IT J 


648 


27 


65. 


9 


258 


16 


Q83PG7 


Q83pg7 shigella fl 


649 


27 


65. 


9 


260 


10 


Q7XHJ8 


Q7xhj8 lycopersico 


650 


27 


65. 


9 


264 


16 


Q97IX6 


Q97ix6 Clostridium 


651 


27 


65. 


9 


265 


16 


Q9Z9J1 


Q9z9jl bacillus ha 
> 


652 


27 


65. 


9 


266 


5 


Q8SV83 


Q8sv83 encephalito 


653 


27 


65. 


9 


266 


5 


Q8ST65 


Q8st65 encephalito 


654 


27 


65. 


9 


266 


16 


Q8A3T0 


Q8a3t0 bacteroides 


655 


27 


65. 


9 


267 


5 


Q8SVF5 


Q8svf5 encephalito 


656 


27 


65. 


9 


270 


16 


P74439 


P74439 synechocyst 


657 


27 


65. 


9 


271 


5 


Q93319 


Q93319 caenorhabdi 


658 


27 


65. 


9 


272 


4 


Q7Z2S1 


Q7z2sl homo sapien 


659 


27 


65. 


9 


273 


16 


Q8YMB8 


Q8ymb8 anabaena sp 


660 


27 


65. 


9 


275 


16 


Q8F4P8 


Q8f4p8 leptospira 


661 


27 


65. 


9 


276 


5 


044646 


044646 caenorhabdi 


662 


27 


65. 


9 


276 


16 


Q8F0K4 


Q8f0k4 leptospira 


663 


27 


65. 


9 


276 


16 


Q8EUB1 


Q8eubl mycoplasma 


664 


27 


65. 


9 


281 


16 


Q8EUJ6 


Q8euj6 mycoplasma 


665 


27 


65. 


9 


281 


17 


029061 


029061 archaeoglob 


666 


27 


65. 


9 


282 


10 


Q9ZVG2 


Q9zvg2 arabidopsis 


667 


27 


65. 


9 


283 


5 


Q8SY96 


Q8sy96 drosophila 


668 


27 


65. 


9 


286 


5 


Q9U4F4 


Q9u4f4 drosophila 


669 


27 


65. 


9 


292 


17 


Q9Y9R9 


Q9y9r9 aeropyrum p 


670 


27 


65. 


9 


293 


4 


Q8IZ81 


Q8iz81 homo sapien 


671 


27 


65. 


9 


293 


10 


Q9LXR3 


Q91xr3 arabidopsis 


672 


27 


65. 


9 


293 


11 


Q8BHP5 


Q8bhp5 mus musculu 


673 


27 


65. 


9 


293 


11 


Q8BGF6 


Q8bgf6 mus musculu 


674 


27 


65. 


9 


298 


16 


Q8CUH2 


Q8cuh2 oceanobacil 


675 


27 


65. 


9 


299 


17 


Q97BC0 


Q97bc0 thermoplasm 


676 


27 


65. 


9 


300 


2 


Q9AEC8 


Q9aec8 lactobacill 


677 


27 


65. 


9 


300 


16 


Q7V2 83 


Q7v283 prochloroco 


678 


27 


65. 


9 


308 


10 


Q8LD18 


Q81dl8 arabidopsis 


679 


27 


65. 


9 


309 


16 


Q7UV34 


Q7uv34 rhodopirell 


680 


27 


65. 


9 


310 


8 


063638 


063638 macrosteles 


681 


27 


65. 


9 


310 


8 


063630 


063630 dalbulus lo 


682 


27 


65. 


9 


310 


8 


063629 


063629 dalbulus gu 


683 


27 


65. 


9 


311 


13 


Q802Z6 


Q802z6 brachydanio 


684 


27 


65. 


9 


314 


16 


Q9K834 


Q9k834 bacillus ha 



685 


27 


65. 


,9 


315 


5 


Q7YYR2 


Q7yyr2 cryptospori 


686 


27 


65. 


,9 


315 


16 


Q99S53 


Q99s53 staphylococ 


687 


27 


65, 


.9 


317 


2 


Q8RL22 


Q8rl22 providencia 


688 


27 


65. 


9 


317 


13 


Q7T180 


Q7tl80 gallus gall 


689 


27 


65. 


, 9 


318 


. 17 


Q96Z66 


Q96z66 sulf olobus 


690 


27 


65. 


, 9 


319 


5 


Q8STM2 


Q8stm2 encephalito 


691 


27 


65. 


9 


320 


17 


Q97AV8 


Q97av8 thermoplasm 


692 


27 


65. 


9 


323 


5 


045475 


045475 caenorhabdi 


693 


27 


65. 


9 


323 


5 


062050 


062050 caenorhabdi 


694 


27 


65. 


9 


323 


16 


Q9CHJ5 


Q9chj5 lactococcus 


695 


27 


65. 


9 


324 


16 


Q8EAY2 


Q8eay2 shewanella 


696 


27 


65. 


9 


326 


2 


Q44403 


Q44403 agrobacteri 


697 


27 


65. 


9 


326 


16 


Q898Z4 


Q898z4 Clostridium 


698 


27 


65. 


9 


328 


10 


Q9FYF3 


Q9fvf3 arabidopsis 


699 


27 


65. 


9 


333 


16 


Q7WMC2 


Q7wmc2 bordetella 


700 


27 


65. 


9 


337 


10 


Q8H110 


Q8hll0 arabidopsis 


701 


27 


65. 


9 


337 


16 


Q93N67 


Q93n67 coxiella bu 


702 


27 


65. 


9 


339 


10 


Q9SFZ7 


Q9sfz7 arabidopsis 


703 


27 


65. 


9 


340 


5 


Q869P7 


Q869p7 dictyosteli 


704 


27 


65. 


9 


345 


16 


Q8XPV7 


Q8xpv7 ralstonia s 


705 


27 


65. 


9 


347 


16 


Q92XW3 


Q92xw3 rhizobium m 


706 


27 


65. 


9 


350 


11 


Q923H4 


Q923h4 mus rnusculu 


707 


27 


65. 


9 


358 


8 


Q9TLM2 


Q9tlm2 compsopogon 


708 


27 


65. 


9 


360 


2 


Q9KI41 


Q9ki41 agrobacteri 


709 


27 


65. 


9 


360 


2 


Q9ADZ7 


Q9adz7 agrobacteri 


710 


27 


65. 


9 


361 


16 


Q92TH6 


Q92th6 rhizobium m 


711 


27 


65. 


9 


363 


5 


017784 


017784 caenorhabdi 


712 


27 


65. 


9 


365 


12 


Q91TQ9 


Q91tq9 tupaia herp 


713 


27 


65. 


9 


366 


10 


Q9M7D5 


Q9m7d5 ipomoea bat 


714 


27 


65. 


9 


366 


10 


Q9AUC5 


Q9auc5 ipomoea bat 


715 


27 


65. 


9 


368 


3 


Q9UT60 


Q9ut60 schizosacch 


716 


27 


65. 


9 


368 


10 


Q9M7D6 


Q9m7d6 ipomoea bat 


717 


27 


65. 


9 


368 


10 


Q84RM9 


Q84rm9 ipomoea bat 


718 


27 


65. 


9 


368 


10 


Q84RM8 


Q84rm8 ipomoea bat 


719 


27 


65. 


9 


372 


8 


Q9THP4 


Q9thp4 erythroclad 


720 


27 


65. 


9 


372 


8 


Q9TLM6 


Q9tlm6 compsopogon 


721 


27 


65. 


9 


372 


8 


Q9TLM3 


Q9tlm3 erythrotric 


722 


27 


65. 


9 


372 


16 


Q8EPT3 


Q8ept3 oceanobacil 


723 


27 


65. 


9 


373 


8 


Q8WI59 


Q8wi59 compsopogon 


724 


27 


65. 


9 


374 


5 


Q9TXY9 


Q9txy9 caenorhabdi 


725 


27 


65. 


9 


374 


16 


Q927X3 


Q927x3 listeria in 


726 


27 


65. 


9 


375 


8 


Q9TLM4 


Q9tlm4 compsopogon 


727 


27 


65. 


9 


375 


16 


Q88BQ5 


Q88bq5 pseudomonas 


728 


27 


65. 


9 


376 


16 


Q97LY4 


Q971y4 Clostridium 


729 


27 


65. 


9 


380 


8 


Q9TLM5 


Q9tlm5 compsopogon 


730 


27 


65. 


9 


380 


16 


Q8E6X6 


Q8e6x6 streptococc 


731 


27 


65. 


9 


380 


16 


Q8E1F9 


Q8elf9 streptococc 


732 


27 


65. 


9 


382 


2 


Q9RM37 


Q9rm37 escherichia 


733 


27 


65. 


9 


382 


10 


Q9LHW0 


Q91hw0 oryza sativ 


734 


27 


65. 


9 


383 


5 


Q8I280 


Q8i28 0 Plasmodium 


735 


27 


65. 


9 


383 


10 


Q9SKP8 


Q9skp8 arabidopsis 


736 


27 


65. 


9 


384 


8 


Q8LWZ3 


Q81wz3 laminaria d 


737 


27 


65. 


9 


384 


16 


Q8ESE3 


Q8ese3 oceanobacil 


738 


27 


65. 


9 


385 


16 


Q9AA68 


Q9aa68 caulobacter 


739 


27 


65. 


9 


389 


16 


Q8 9UG7 


Q89ug7 bradyrhizob 


740 


27 


65. 


9 


390 


5 


Q9TXJ3 


Q9txj3 leishmania 


741 


27 


65. 


9 


390 


16 


Q8CMV2 


Q8cmv2 staphylococ 



742 


27 


65. 


9 


393 


10 


Q9ZWA1 


Q9zwal arabidopsis 


743 


27 


65. 


9 


393 


10 


Q8GZ44 


Q8gz44 arabidopsis 


744 


27 


65. 


9 


393 


16 


Q9A5N6 


Q9a5n6 caulobacter 


745 


27 


65. 


9 


394 


16 


P73566 


P73566 synechocyst 


746 


27 


65. 


9 


398 


5 


Q26004 


Q2 6004 Plasmodium 


747 


27 


65. 


9 


398 


5 


Q9U433 


Q9u433 Plasmodium 


748 


27 


65. 


9 


398 


5 


Q9U432 


Q9u432 Plasmodium 


749 


27 


65. 


9 


398 


5 


Q9TVT9 


Q9tvt9 Plasmodium 


750 


27 


65. 


9 


398 


5 


Q8I484 


Q8i484 Plasmodium 


751 


27 


65. 


9 


398 


12 


Q8BA41 


Q8ba41 avian param 


752 


27 


65'. 


9 


399 


16 


Q9PN56 


Q9pn56 campylobact 


753 


27 


65. 


9 


400 


5 


Q8MUN1 


Q8munl Plasmodium 


754 


27 


65. 


9 


400 


5 


Q8I485 


Q8i4 85 Plasmodium 


755 


27 


65. 


9 


400 


16 


Q8YKU6 


Q8yku6 anabaena sp 


756 


27 


65. 


9 


403 


5 


Q8T2W3 


Q8t2w3 trypanosoma 


757 


27 


65. 


9 


405 


8 


Q8MAA3 


Q8maa3 rhodochaete 


758 


27 


65. 


9 


405 


10 


Q9SXC4 


Q9sxc4 arabidopsis 


759 


27 


65. 


9 


406 


16 


Q82SE8 


Q82se8 nitrosomona 


760 


27 


65. 


9 


407 


16 


Q9ZDJ2 


Q9zdj2 rickettsia 


761 


27 


65. 


9 


409 


16 


Q7VNU4 


Q7vnu4 haemophilus 


762 


27 


65. 


9 


410 


16 


Q9RGT4 


Q9rgt4 salmonella 


763 


27 


65. 


9 


410 


17 


Q979W6 


Q979w6 thermoplasm 


764 


27 


65. 


9 


412 


10 


Q9SHS5 


Q9shs5 arabidopsis 


765 


27 


65. 


9 


414 


17 


Q8U2N0 


Q8u2n0 pyrococcus 


766 


27 


65. 


9 


415 


12 


P87751 


P87751 hepatitis c 

J. 


767 


27 


• 65. 


9 


415 


16 


Q92S40 


Q92s40 rhizobium m 


768 


27 


65. 


9 


417 


16 


025454 


025454 helicobacte 


769 


27 


65. 


9 


418 


16 


Q8EHV4 


Q8ehv4 shewanella 


770 


27 


65. 


9 


420 


16 


Q9PEW9 


Q9pew9 xylella fas 


771 


27 


65. 


9 


420 


16 


Q87AP7 


Q87ap7 xylella fas 


772 


27 


65. 


9 


423 


3 


Q04429 


Q0442 9 saccharomyc 


773 


27 


65. 


9 


424 


3 


074508 


O74508 schizosacch 


774 


27 


65. 


9 


428 


5 


Q8SXA2 


Q8sxa2 drosophila 


775 


27 


65. 


9 


430 


5 


Q17814 


Q17814 caenorhabdi 


776 


27 


65. 


9 


431 


8 


Q31955 


Q31955 compsopogon 


111 


27 


65. 


9 


431 


16 


066787 


066787 aquifex aeo 


IIS 


27 


65. 


9 


438 


5 


Q9VNC2 


Q9vnc2 drosophila 


779 


27 


65. 


9 


438 


16 


Q8A8U0 


Q8a8u0 bacteroides 


780 


27 


65. 


9 


439 


4 


Q9BUG9 


Q9bug9 homo sapien 


781 


27 


65. 


9 


439 


10 


Q9C865 


Q9c865 arabidopsis 


782 


27 


65. 


9 


441 


16 


067719 


067719 aquifex aeo 


783 


27 


65. 


9 


441 


16 


Q89RS5 


Q89rs5 bradyrhizob 


784 


27 


65. 


9 


442 


11 


Q8C3Z6 


Q8c3z6 mus musculu 


785 


27 


65. 


9 


442 


11 


Q921V5 


Q921v5 mus musculu 


786 


27 


65. 


9 


444 


11 


Q9CV2 8 


Q9cv2 8 mus musculu 


787 


27 


65. 


9 


445 


4 


Q9H8M7 


Q9h8m7 homo sapien 


788 


27 


65. 


9 


445 


5 


Q9U2K7 


Q9u2k7 caenorhabdi 


789 


27 


65. 


9 


446 


6 


019071 


019071 s alpha-1,6 


790 


27 


65. 


9 


446 


16 


Q8ZFV5 


Q8zfv5 yersinia pe 


791 


27 


65. 


9 


446 


16 


Q83A43 


Q83a43 coxiella bu 


792 


27 


65. 


9 


447 


2 


Q93HR7 


Q93hr7 vibrio para 


793 


27 


65. 


9 


451 


10 


Q9FNU6 


Q9fnu6 populus x c 


794 


27 


65. 


9 


452 


16 


Q81V86 


Q81v86 bacillus an 


795 


27 


65. 


9 


454 


10 


Q9FUE7 


Q9fue7 populus del 


796 


27 


65. 


9 


454 


10 


Q9FUE9 


Q9fue9 populus tri 


797 


27 


65. 


9 


454 


10 


Q9FUE8 


Q9fue8 populus tri 


798 


27 


65. 


9 


455 


4 


Q96S13 


Q96sl3 homo sapien 



799 


27 


65. 


9 


455 


5 


Q8SXR5 


Q8sxr5 drosophila 


800 


27 


65. 


9 


455 


16 


Q8CZE5 


Q8cze5 oceanobacil 


801 


27 


65. 


9 


458 


16 


Q8DDA4 


Q8dda4 vibrio vuln 


802 


27 


65. 


9 


459 


16 


Q83AP6 


Q83ap6 coxiella bu 


803 


27 


65. 


9 


461 


10 


Q98RU1 


Q98rul guillardia 


804 


27 


65. 


9 


463 


10 


Q7X8X9 


Q7x8x9 oryza sativ 


805 


27- 


65. 


9 


468 


3 


Q9P8W6 


Q9p8w6 Candida alb 


806 


27 


65. 


9 


470 


2 


Q51847 


Q51847 prochloroth 


807 


27 


65. 


9 


470 


4 


Q9BZE6 


Q9bze6 homo sapien 


808 


27 


65. 


9 


470 


16 


Q8YFJ3 


Q8yfj3 brucella me 


809 


27 


65. 


9 


470 


16 


Q8G2B8 


Q8g2b8 brucella su 


810 


27 


65. 


9 


470 


17 


Q8TQM4 


Q8tqm4 methanosarc 


811 


27 


65. 


9 


473 


2 


Q9ZB35 


Q9zb35 hydrogenoph 


812 


27 


65. 


9 


473 


5 


Q9VN12 


Q9vnl2 drosophila 


813 


27 


65. 


9 


475 


8 


Q8WJ2 0 


Q8wj20 ristantia g 


814 


27 


65. 


9 


476 


12 


Q8QNG3 


Q8qng3 ectocarpus 


815 


27 


65. 


9 


478 


16 


Q8YM51 


Q8ym51 anabaena sp 


816 


27 


65. 


9 


480 


17 


029235 


029235 archaeoglob 


817 


27 


65. 


9 


483 


12 


Q82172 


Q82172 leporid her 


818 


27 


65. 


9 


485 


5 


Q18544 


Q18544 caenorhabdi 


819 


27 


65. 


9 


488 


2 


030662 


030662 vibrio chol 


820 


27 


65. 


9 


488 


16 


Q9KQ66 


Q9kq66 vibrio chol 


821 


27 


65. 


9 


489 


16 


Q8XPS9 


Q8xps9 ralstonia s 


822 


27 


65. 


9 


493 


10 


Q7XKG0 


Q7xkg0 oryza sativ 


823 


27 


65. 


9 


501 


16 


Q7UZT9 


Q7uzt9 prochloroco 


824 


27 


65. 


9 


503 


2 


Q8GJK0 


Q8gjk0 meiothermus 

W* -w* 


825 


27 


65. 


9 


503 


8 


Q8W797 


Q8w797 lysicarpus 


826 


27 


65. 


9 


503 


8 


Q9TKD2 


Q9tkd2 tristaniops 


827 


27 


65. 


9 


503 


8 


Q7YN83 


Q7yn83 diospyros v 


828 


27 


65. 


9 


505 


13 


Q8AWC7 


Q8awc7 gallus gall 


829 


27 


65. 


9 


505 


16 


069566 


069566 mycobacteri 


830 


27 


65. 


9 


506 


3 


Q9USN0 


Q9usn0 schizosacch 


831 


27 


65. 


9 


506 


16 


Q837F7 


Q837f7 enterococcu 


832 


27 


65. 


9 


511 


16 


Q8CXK8 


Q8cxk8 oceanobacil 


833 


27 


65. 


9 


513 


5 


Q9V8T4 


Q9v8t4 drosophila 


834 


27 


• 65. 


9 


513 


5 


Q8IHB1 


Q8ihbl drosophila 


835 


27 


65. 


9 


514 


5 


076958 


076958 leishmania 


836 


27 


65. 


9 


521 


16 


Q92DS3 


Q92ds3 listeria in 


837 


27 


65. 


9 


522 


16 


Q8PKK8 


Q8pkk8 xanthomonas 


838 


27 


65. 


9 


523 


16 


Q9CLU9 


Q9clu9 pasteurella 


839 


27 


65. 


9 


530 


11 


Q8C2Z3 


Q8c2z3 mus musculu 


840 


27 


65. 


9 


532 


2 


Q8KQZ4 


Q8kqz4 vibrio chol 


841 


27 


65. 


9 


537 


4 


Q9HA80 


Q9ha80 homo sapien 


842 


27 


65. 


9 


541 


10 


Q851K4 


Q851k4 oryza sativ 


843 


27 


65. 


9 


542 


10 


Q9LYP8 


Q91yp8 arabidopsis 


844 


27 


65. 


9 


555 


2 


Q8KY42 


Q8ky42 streptomyce 


845 


27 


65. 


9 


555 


16 


Q8U8S0 


Q8u8s0 agrobacteri 


846 


27 


65. 


9 


557 


10 


Q9XIC5 


Q9xic5 arabidopsis 


847 


27 


65. 


9 


568 


5 


Q9NAL4 


Q9nal4 caenorhabdi 


848 


27 


65. 


9 


568 


16 


Q92HM4 


Q92hm4 rickettsia 


849 


27 


65. 


9 


569 


2 


Q93CY6 


Q93cy6 lactobacill 


850 


27 


65. 


9 


571 


16 


Q8XHF8 


Q8xhf 8 Clostridium 


851 


27 


65. 


.9 


576 


16 


006137 


O06137 mycobacteri 


852 


27 


65. 


,9 


576 


16 


Q7TZV7 


Q7tzv7 mycobacteri 


853 


27 


65. 


.9 


579 


10 


Q8S8Q2 


Q 8 s 8 q2 a r abidop sis 


854 


27 


65. 


. 9 


580 


16 


Q9X2G1 


Q9x2gl thermotoga 


855 


27 


65. 


,9 


581 


16 


Q8A7H6 


Q8a7h6 bacteroides 



856 


27 


65. 


9 


582 


10 


023492 


0234 92 arabidopsis 


857 


27 


65. 


9 


584 


16 


Q55449 


Q5544 9 synechocyst 


858 


27 


65. 


9 


590 


10 


Q9M3B7 


Q9m3b7 arabidopsis 


859 


27 


65. 


9 


594 


12 


Q9PYU0 


Q9pyu0 xestia c-ni 


860 ■ 


27 


65. 


9 


597 


10 


004659 


004 659 arabidopsis 


861 


27 


65. 


9 


600 


16 


Q8E3U4 


Q8e3u4 streptococc 


862 


27 


65. 


9 


600 


16 


Q8DY79 


Q8dy79 streptococc 


863 


27 


65. 


9 


602 


10 


Q8LHS7 


Q81hs7 oryza sativ 


864 


27 


65. 


9 


602 


16 


Q92HU2 


Q92hu2 rickettsia 


865 


27 


65. 


9 


604 


5 


Q9U1E2 


Q9ule2 leishmania 


866 


27 


65. 


9 


605 


12 


Q8QVE4 


Q8qve4 eyach virus 


867 


27 


65. 


9 


607 


5 


Q9WC1 


Q9vvcl drosophila 


868 


27 


65. 


9 


607 


10 


Q9FL94 


Q9fl94 arabidopsis 


869 


27 


65. 


9 


607 


10 


Q8L622 


Q81622 arabidopsis 


870 


27 


65. 


9 


612 


16 


Q9I0E3 


Q9i0e3 pseudomonas 


871 


27 


65. 


9 


612 


16 


Q8Y2U3 


Q8y2u3 ralstonia s 


872 


27 


65. 


9 


613 


16 


Q8P8R4 


Q8p8r4 xanthomonas 


873 


27 


65. 


9 


616 


5 


Q8I324 


Q81324 plasmodium 


874 


27 


65. 


9 


616 


10 


Q9LV10 


Q91vl0 arabidopsis 


875 


27 


65. 


9 


617 


2 


Q8RP27 


Q8rp27 actinomadur 


876 


27 


65. 


9 


625 


2 


Q93EB1 


Q93ebl rhizobium 1 


877 


27 


65. 


9 


625 


16 


Q8U8V6 


Q8u8v6 agrobacteri 


878 


27 


65. 


9 


627 


16 


Q89MD4 


Q89md4 bradyrhizob 


879 


27 


65. 


9 


628 


5 


Q8IPM4 


Q8iprn4 drosophila 


880 


27 


65. 


9 


628 


16 


Q89EW3 


Q89ew3 bradyrhizob 


881 


27 


65. 


9 


631 


16 


Q9KE2 9 


Q9ke29 bacillus ha 


882 


27 


65. 


9 


631 


16 


Q8EN36 


Q8en36 oceanobacil 


883 


27 


65. 


9 


631 


16 


Q81YT2 


Q81yt2 bacillus an 


884 


27 


65. 


9 


631 


16 


Q81I69 


Q81i69 bacillus ce 


885 


27 


65. 


9 


639 


16 


Q8D8E9 


Q8d8e9 vibrio vuln 


886 


27 


65. 


9 


640 


16 


Q92PE5 


Q92pe5 rhizobium m 


887 


27 


65. 


9 


641 


3 


Q870Z9 


Q870z9 neurospora 


888 


27 


65. 


9 


650 


17 


Q8TSE0 


Q8tse0 methanosarc 


889 


27 


65. 


9 


659 


2 


Q8RP47 


Q8rp47 azospirillu 


890 


27 


65. 


9 


660 


16 


Q930J2 


Q930j2 rhizobium m 


891 


27 


65. 


9 


660 


16 


Q88CC9 


Q88cc9 pseudomonas 


892 


27 


65. 


9 


661 


16 


Q92Y86 


Q92y86 rhizobium m 


893 


27 


65. 


9 


662 


5 


Q9VLK1 


Q9vlkl drosophila 


894 


27 


65. 


9 


667 


16 


Q7UZL0 


Q7uzl0 prochloroco 


895 


27 


65. 


9 


669 


17 


Q8PVI7 


Q8pvi7 methanosarc 


896 


27 


65. 


9 


694 


16 


Q97T45 


Q97t45 streptococc 


897 


27 


65. 


9 


702 


16 


Q8UD86 


Q8ud86 agrobacteri 


898 


27 


65. 


9 


702 


16 


Q8DRI8 


Q8dri8 streptococc 


899 


27 


65. 


9 


706 


5 


Q9VYR5 


Q9vyr5 drosophila 


900 


27 


65. 


9 


736 


5 


044857 


044857 caenorhabdi 


901 


27 


65. 


9 


743 


13 


Q9YGE7 


Q9yge7 oncorhynchu 


902 


27 


65. 


9 


749 


12 


Q7T4K3 


Q7t4k3 pea stem ne 


903 


27 


65. 


9 


750 


11 


Q80T69 


Q80t69 mus musculu 


904 


27 


65. 


9 


754 


4 


Q9NUP6 


Q9nup6 homo sapien 


905 


27 


65. 


9 


761 


2 


Q9F7S9 


Q9f7s9 gamma-prote 


906 


27 


65. 


9 


779 


4 


Q9H3P4 


Q9h3p4 homo sapien 


907 


27 


65. 


9 


780 


17 


Q9HJC5 


Q9hjc5 thermoplasm 


908 


27 


65. 


9 


782 


16 


Q8DBS6 


Q8dbs6 vibrio vuln 


909 


27 


65. 


9 


783 


16 


Q9A842 


Q9a842 caulobacter 


910 


27 


65. 


9 


784 


10 


Q9FVA3 


Q9fva3 zea mays (m 


911 


27 


65. 


9 


791 


16 


Q98RJ1 


Q98rjl mycoplasma 


912 


27 


65. 


9 


795 


11 


Q7TNJ3 


Q7tnj3 mus musculu 



913 


27 


65. 


.9 


826 


4 


Q8IY15 


Q8iyl5 homo sapien 


914 


27 


65. 


,9 


840 


2 


Q8KLI9 


Q8kli9 thermomonos 


915 


27 


65. 


9 


842 


16 


Q8PN87 


Q8pn87 xanthomonas 


916 


27 


65. 


9 


861 


16 


Q8PBN1 


Q8pbnl xanthomonas 


917 


27 


65. 


9 


865 


5 


P90740 


P90740 caenorhabdi 


918 


27 


65. 


9 


879 


2 


Q8KKY1 


Q8kkyl rhizobium e 


919 


27 


65. 


9 


879 


2 


030579 


030579 bacillus su 


920 


27 


65. 


9 


879 


16 


031504 


031504 bacillus su 


921 


27 


65. 


9 


884 


5 


Q8I7A1 


Q8i7al Plasmodium 


922 


27 


65. 


9 


895 


5 


Q86KM1 


Q86kml dictyosteli 


923 


27 


65. 


9 


895 


5 


Q7Z1B9 


Q7zlb9 Plasmodium 


924 


27 


65. 


9 


915 


16 


Q7VCW0 


Q7vcw0 prochloroco 


925 


27 


65. 


9 


923 


10 


Q9FTZ7 


Q9ftz7 oryza sativ 


926 


27 


65. 


9 


925 


5 


Q9U7E8 


Q9u7e8 caenorhabdi 


927 


27 


65. 


9 


930 


16 


Q8E6N1 


Q8e6nl streptococc 


928 


27 


65. 


9 


930 


16 


Q8E177 


Q8el77 streptococc 


929 


27 


65. 


9 


930 


16 


Q8DVD3 


Q8dvd3 streptococc 


930 


27 


65. 


9 


930 


16 


Q8DNW4 


Q8dnw4 streptococc 


931 


27 


65. 


9 


932 


16 


Q8 8V8 9 


Q88v89 lactobacill 


932 


27 


65. 


9 


933 


16 


Q99YW3 


Q99yw3 streptococc 


933 


27 


65. 


9 


933 


16 


Q8P068 


Q8p068 streptococc 


934 


27 


65. 


9 


933 


16 


Q8K6S2 


Q8k6s2 streptococc 


935 


27 


65. 


9 


949 


3 


014093 


014093 schizosacch 


936 


27 


65. 


9 


951 


13 


Q7ZUF4 


Q7zuf4 brachydanio 


937 


27 


65. 


9 


954 


4 


Q8WXV8 


Q8wxv8 homo sapien 


938 


27 


65. 


9 


957 


4 


Q9H0V3 


Q9h0v3 homo sapien 


939 


27 


65. 


9 


957 


4 


Q96P44 


Q96p44 homo sapien 


940 


27 


65. 


9 


971 


5 


Q95Q20 


Q95q20 caenorhabdi 


941 


27 


65. 


9 


996 


16 


Q97NR7 


Q97nr7 streptococc 


942 


27 


65. 


9 


996 


16 


Q8DNF6 


Q8dnf6 streptococc 


943 


27 


65. 


9 


1014 


10 


Q84KC5 


Q84kc5 hordeum vul 


944 


27 


65. 


9 


1021 


5 


015733 


015733 dictyosteli 


945 


27 


65. 


9 


1024 


11 


Q9ESJ5 


Q9esj5 mus musculu 


946 


27 


65. 


9 


1025 


16 


Q830R3 


Q830r3 enterococcu 


947 


27 


65. 


9 


1030 


16 


Q8DSP2 


Q8dsp2 streptococc 


948 


27 


65. 


9 


1056 


10 


Q7XV85 


Q7xv85 oryza sativ 


949 


27 


65. 


9 


1059 


10 


Q8LI36 


Q81i36 oryza sativ 


950 


27 


65. 


9 


1061 


13 


Q9I9K0 


Q9i9k0 xenopus lae 


951 


27 


65. 


9 


1074 


16 


Q88TG3 


Q88tg3 lactobacill 


952 


27 


65. 


9 


1079 


12 


Q9YW56 


Q9yw56 melanoplus 


953 


27 


65. 


9 


1105 


5 


Q9VX31 


Q9vx31 drosophila 


954 


27 


65. 


9 


1106 


5 


Q8MMD3 


Q8mmd3 drosophila 


955 


27 


65. 


9 


1117 


5 


Q9VK50 


Q9vk50 drosophila 


956 


27 


65. 


9 


1121 


5 


Q9NJ94 


Q9nj 94 drosophila 


957 


27 


65. 


9 


1124 


16 


Q9CM7 6 


Q9cm7 6 pasteurella 


958 


27 


65. 


9 


1153 


16 


Q92JC1 


Q92 j cl rickettsia 


959 


27 


65. 


9 


1155 


16 


Q9ZE40 


Q9ze4 0 rickettsia 


960 


27 


65. 


9 


1165 


5 


Q962L9 


Q96219 Plasmodium 


961 


27 


65. 


9 


1168 


10 


Q9LE81 


Q91e8 1 arabidopsis 


962 


27 


65. 


9 


1182 


11 


Q8C6K9 


Q8c6k9 mus musculu 


963 


27 


65. 


9 


1196 


10 


Q9SG02 


Q9sg02 arabidopsis 


964 


27 


65. 


9 


1196 


10 


Q9LKP0 


Q91kp0 arabidopsis 


965 


27 


65. 


9 


1218 


10 


Q8LHH9 


Q81hh9 oryza sativ 


966 


27 


65. 


9 


1225 


2 


Q8RJI1 


Q8r jil providencia 


967 


27 


65. 


9 


1225 


2 


Q849Z9 


Q84 9z9 escherichia 


968 


27 


65. 


9 


1232 


5 


Q8MMD2 


Q8mmd2 drosophila 


969 


27 


65. 


9 


1253 


5 . 


Q8WQ61 


Q8wq61 drosophila 



970 


27 


65. 


9 


1253 


5 


Q9W111 


Q9wlll drosophila 


971 


27 


65. 


9 


1253 


5 


Q8IKA0 


Q8ika0 plasmodium 


972 


27 


65. 


9 


1293 


10 


Q9M8Z0 


Q9m8z0 arabidopsis 


973 


27 


65. 


9 


1309 


10 


Q9SXF0 


Q9sxf0 arabidopsis 


974 


27 


65. 


9 


1321 


10 


064418 


064418 arabidopsis 


975 


27 


65. 


9 


1431 


5 


Q9NG75 


Q9ng75 parartemia 


976 


27 


65. 


9 


1457 


10 


Q8LPU6 


Q81pu6 zea mays (m 


977 


27 


65. 


9 


1466 


10 


Q94IU9 


Q94iu9 arabidopsis 


978 


27 


65. 


9 


1470 


11 


Q8CHD4 


Q8chd4 mus musculu 


979 


27 


65. 


9 


1470 


12 


Q8 JYK1 


Q8jykl fiji diseas 


980 


27 


65. 


9 


1488 


4 


Q9HCM3 


Q9hcm3 homo sapien 


981 


27 


65. 


9 


1525 


10 


065343 


065343 zea mays (m 


982 


27 


65. 


9 


1557 


11 


Q811F2 


Q811f2 mus musculu 


983 


27 


65. 


9 


1620 


5 


045000 


04 5 0 0 0 caenorhabdi 


984 


27 


65. 


9 


1725 


6 


Q867A1 


Q867al canis famil 


985 


27 


65. 


9 


1725 


11 


P70570 


P70570 rattus norv 


986 


27 


65. 


9 


1743 


4 


Q9BYP7 


Q9byp7 homo sapien 


987 


27 


65. 


9 


1800 


4 


Q8TCX6 


Q8tcx6 homo sapien 


988 


27 


65. 


9 


1806 


4 


Q96TG0 


Q96tg0 homo sapien 


989 


27 


65. 


9 


1820 


12 


Q90153 


Q90153 ustilago ma 


990 


27 


65. 


9 


1840 


11 


Q60467 


Q60467 cricetulus 


991 


27 


65. 


9 


1998 


12 


056050 


056050 rhopalosiph 


992 


27 


65. 


9 


2181 


5 


Q8IFQ4 


Q8ifq4 plasmodium 


993 


27 


65. 


9 


2201 


5 


Q8IFQ5 


Q8ifq5 Plasmodium 


994 


27 


65. 


9 


2219 


4 


Q9C0A3 


Q9c0a3 homo sapien 


995 


27 


65. 


9 


2256 


5 


Q8I5L5 


Q8i515 plasmodium 


996 


27 


65. 


9 


2267 


5 


Q8IBX1 


Q8ibxl plasmodium 


997 


27 


65. 


9 


2270 


5 


Q8I3E5 


Q8i3e5 plasmodium 


998 


27 


65. 


9 


2296 


4 


Q9Y3S1 


Q9y3sl homo sapien 


999 


27 


65. 


9 


2613 


5 


Q9GYD1 


Q9gydl leishmania 


1000 


27 


65. 


9 


3013 


12 


092530 


092530 hepatitis c 



ALIGNMENTS 



RESULT 1 
Q9UCD1 

ID Q9UCD1 PRELIMINARY; PRT; 28 AA. 

AC Q9UCD1; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Beta-amyloid peptide (Fragment) . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE. 

RX MEDLINE=94045685; PubMed=8229004 ; 

RA Vigo-Pelfrey C, Lee D. , Keim P., Lieberburg I., Schenk D.B.; 

RT "Characterization of beta-amyloid peptide from human cerebrospinal 

RT fluid."; 

RL J. Neurochem. 61:1965-1968(1993). 

DR HSSP; P05067; IAMB. 

DR GO; GO: 0016020; C:membrane; IEA. 



DR 
DR 
SQ 



InterPro; IPR001255; Beta-APP. 
Pfam; PF03494; Beta-APP; 1. 

SEQUENCE 28 AA; 3244 MW; DE7BD081160AFC81 CRC64; 



Query Match 85.4%; Score 35; DB 4; Length 28; 

Best Local Similarity 100.0%; Pred. No. 3.4; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 LVFFAED 7 

I I I I I I I 
Db 17 LVFFAED 23 



RESULT 2 
Q9UCA9 

ID Q9UCA9 PRELIMINARY; PRT; 30 AA. 

AC Q9UCA9; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Beta-amyloid protein (Fragment) . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE. 

RX MEDLINE-94153015; PubMed=8 109908 ; 

RA Wisniewski T., Lalowski M. , Levy E . , Marques M.R., Frangione B.; 

RT "The amino acid sequence of neuritic plaque amyloid from a familial 

RT Alzheimer's disease patient."; 

RL Ann. Neurol. 35:245-246(1994). 

DR HSSP; P05067; 1BA4 . 

DR GO; GO: 0016020; C:membrane; IEA. 

DR InterPro; IPR001255; Beta-APP. 

DR Pfam; PF03494; Beta-APP; 1. 

SQ SEQUENCE 30 AA; 3391 MW; FF4 167ABD081160A CRC64; 

Query Match 85.4%; Score 35; DB 4; Length 30; 

Best Local Similarity 100.0%; Pred. No. 3.7; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 LVFFAED 7 

I I I I I I I 
Db 17 LVFFAED 23 



RESULT 3 
Q9UC33 

ID Q9UC33 PRELIMINARY; PRT; 33 AA. 

AC Q9UC33; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Beta-amyloid peptide (Fragment) . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 



OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE. 

RX MEDL INE= 93024877; PubMed=l 4 06936; 

RA Seubert P . , Vigo-Pelfrey C, Esch F. , Lee M. , Dovey H., Davis D., 

RA Sinha S., Schlossmacher M., Whaley J., Swindlehurst C; 

RT "Isolation and quantification of soluble Alzheimer's beta-peptide from 

RT biological fluids."; 

RL Nature 359:325-327(1992). 

DR HSSP; P05067; 1BA4 . 

DR GO; GO: 0016020; C:membrane; IEA. 

DR InterPro; IPR001255; Beta-APP. 

DR Pfam; PF03494; Beta-APP; 1. 

SQ SEQUENCE 33 AA; 3674 MW; B1DEFE2F4 167ABD0 CRC64; 

Query Match 85.4%; Score 35; DB 4; Length 33; 

Best Local Similarity 100.0%; Pred. No. 4; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 LVFFAED 7 

I II I I I I 

Db 17 LVFFAED 23 



RESULT 4 
035463 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OC 
OX 
RN 
RP 
RA 
RL 
DR 
DR 
DR 
DR 
DR 
FT 
FT 
SQ 



0354 63 PRELIMINARY; PRT; 7 9 AA. 

035463; 

01-JAN-1998 (TrEMBLrel. 05, Created) 

01-JAN-1998 (TrEMBLrel. 05, Last sequence update) 

01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

Alzheimer's amyloid beta protein (Fragment). 

BETA APP. 

Cricetulus griseus (Chinese hamster) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Cricetinae; 
Cricetulus. 
NCBI_TaxID=10029; 
[1] 

SEQUENCE FROM N.A. 

Sambamurti K. , Pinnix I., Gandhi S.; 

Submitted (OCT-1997) to the EMBL/ GenBank/DDBJ databases. 

EMBL; AF030413; AAB86608.1; 

HSSP; P05067; 1BA4 . 

GO; GO: 0016020; C rmembrane; IEA. 

InterPro; IPR001255; Beta-APP. 

Pfam; PF03494; Beta-APP; 1. 

NONJTER 1 1 

NONJTER 79 79 

SEQUENCE 79 AA; 8538 MW; 37F2C6C3BFF3F597 CRC64 ; 



Query Match 85.4%; Score 35; DB 11; Length 79; 

Best Local Similarity 100.0%; Pred. No. 9.8; 

Matches 7; Conservative 0; Mismatches 0; Indels 



0 ; Gaps 



0; 



1 LVFFAED 7 



Db 



37 



I I I I I I I 
LVFFAED 



43 



RESULT 5 
Q16020 

ID Q16020 PRELIMINARY; PRT; 82 AA. 

AC Q16020; 

DT 01-NOV-1996 (TrEMBLrel. 01, Created) 

DT 01-NOV-1996 (TrEMBLrel. 01, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Beta-amyloid peptide (Fragment) . 

GN BETA APP. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=93236601; PubMed-8476439; 

RA Denman R.B., Rosenzcwaig R. , Miller D.L.; 

RT "A system for studying the effect (s) of familial Alzheimer disease 

RT mutations on the processing of the beta-amyloid peptide precursor." 

RL Biochem. Biophys . Res. Coramun. 192:96-103(1993). 

DR EMBL; S61383; AAB26265.2; 

DR HSSP; P05067; 1BA4 . 

DR GO; GO: 0016020; Cimernbrane; IEA. 

DR InterPro; IPR001255; Beta-APP. 

DR Pfam; PF03494; Beta-APP; 1. 

FT NON_TER 1 1 

FT NON_TER 82 82 

SQ SEQUENCE 82 AA; 8 8 82 MW; F534AA5AE5D9230A CRC64; 



Query Match 85.4%; Score 35; DB 4; Length 82; 

Best Local Similarity 100.0%; Pred. No. 10; 

Matches 7; Conservative 0; Mismatches 0; Indels 



0; Gap 



Qy 

Db 



1 LVFFAED 7 

I I I I I II 

34 LVFFAED 4 0 



RESULT 6 
Q16014 

ID Q16014 PRELIMINARY; PRT; 82 AA. 

AC Q16014; 

DT 01-NOV-1996 (TrEMBLrel. 01, Created) 

DT 01-NOV-1996 (TrEMBLrel. 01, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Beta-amyloid peptide (Fragment). 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxI D=9 6 0 6 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=93236601; PubMed=84764 39; 



RA Denrnan R.B., Rosenzcwaig R. , Miller D.L.; 

RT -"A system for studying the effect (s) of familial Alzheimer disease 

RT mutations on the processing of the beta-amyloid peptide precursor." 

RL Biochem. Biophys . Res. Commun. 192:96-103(1993). 

DR EMBL; S60721; AAB26263.2; 

DR HSSP; P05067; 1BA4. 

DR GO; GO: 0016020; C:membrane; IEA. 

DR InterPro; IPR001255; Beta-APP. 

DR Pfam; PF034 94; Beta-APP; 1. 

FT NON_TER 1 1 

FT NON_TER 82 82 

SQ SEQUENCE 82 AA; 8972 MW; F534AA5B3EA9230A CRC64 ; 



Query Match 85.4%; Score 35; DB 4; Length 82; 

Best Local Similarity 100.0%; Pred. No. 10; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gap 



Qy 1 LVFFAED 7 

I I I I I I I 
Db 34 LVFFAED 40 



RESULT 7 
Q16019 

ID Q16019 PRELIMINARY; PRT; 82 AA. 

AC Q16019; 

DT 01-NOV-1996 (TrEMBLrel . 01, Created) 

DT 01-NOV-1996 (TrEMBLrel. 01, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Beta-amyloid peptide (Fragment) . 

GN BETA APP. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N . A. 

RX MEDLINE=93236601; PubMed=8476439 ; 

RA Denrnan R.B., Rosenzcwaig R. , Miller D.L.; 

RT "A system for studying the effect (s) of familial Alzheimer disease 

RT mutations on the processing of the beta-amyloid peptide precursor." 

RL Biochem. Biophys. Res. Commun. 192:96-103(1993). 

DR EMBL; S61380; AAB26264.2; -. 

DR HSSP; P05067; 1BA4 . 

DR GO; GO: 0016020; C : membrane ; IEA. 

DR InterPro; IPR001255; Beta-APP. 

DR Pfam; PF03494; Beta-APP; 1. 

FT NONJTER 1 1 

FT NONJTER 82 82 

SQ SEQUENCE 82 AA; 8938 MW; F534AA50E579230A CRC64; 

Query Match 85.4%; Score 35; DB 4; Length 82; 

Best Local Similarity 100.0%; Pred. No. 10; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gap 



Qy 



1 LVFFAED 7 
I I I I I I I 



Db 



34 LVFFAED 40 



RESULT 8 
Q8JH58 

ID Q8JH58 PRELIMINARY; PRT; 113 AA. 

AC Q8JH58; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Amyloid beta protein (Fragment) . 

OS Chelydra serpentina serpentina (common snapping turtle). 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Testudines; Cryptodira; Testudinoidea ; Chelydridae; Chelydra. 

OX NCBI_TaxID=134619; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=2 187 6906; PubMed=1188247 8 ; 

RA Trudeau V.L., Chiu S., Kennedy S.W., Brooks R.J.; 

RT "Octylphenol (OP) alters the expression of members of the amyloid 

RT protein family in the hypothalamus of the snapping turtle, Chelydra 

RT serpentina serpentina."; 

RL Environ. Health Perspect. 110:269-275(2002). 

DR EMBL; AF541917; AAN04908.1; -. 

DR GO; GO: 00160:20; C:membrane; IEA. 

DR InterPro; IPR008155; A4_APP. 

DR InterPro; IPR001255; Beta-APP. 

DR Pfam; PF03494; Beta-APP; 1. 

DR PRINTS; PR00203; AMYLOIDA4. 

DR PROSITE; PS00320; A4_INTRA; 1. 

FT NON_TER 1 1 

SQ SEQUENCE 113 AA; 12750 MW; 72515C9304 96E053 CRC64 ; 

Query Match 85.4%; Score 35; DB 13; Length 113; 

Best Local Similarity 100.0%; Pred. No. 14; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 LVFFAED 7 

I I I I I I I 
Db 31 LVFFAED 37 



RESULT 9 
Q8BPV5 

ID Q8BPV5 PRELIMINARY; PRT; 218 AA. 

AC Q8BPV5; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Amyloid beta (Fragment) . 

GN APP. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 



RC STRAIN=C57BL/6J; TISSUE=Lung; 

RX MEDLINE=22354683; PubMed=124 66851 ; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation 

RT 60,770 full-length cDNAs . " ; 

RL Nature 42 0:563-573(2002). 

DR EMBL; AK052448; BAC34997.1; -. 

DR MGD; MGI: 88059; App . 

DR GO; GO: 0005515; F:protein binding; IPI. 

DR InterPro; IPR008155; A4_APP. 

DR InterPro; IPR001255; Beta-APP. 

DR Pfam; PF03494; Beta-APP; 1. 

DR PRINTS; PR00203; AMYLOIDA4 . 

DR PROSITE; PS00320; A4_INTRA; 1. 

FT NONJTER 1 1 

SQ SEQUENCE 218 AA; 24118 MW; 95B55AFDAE1D0EF5 CRC64; 



Query Match 85.4%; Score 35; DB 11; Length 218; 

Best Local Similarity 100.0%; Pred. No. 28; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gap 



Qy 1 LVFFAED 7 

I I I I I I I 

Db 136 LVFFAED 142 



RESULT 10 
Q8E547 

ID Q8E547 PRELIMINARY; PRT; 295 AA. 

AC Q8E547; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein. 

GN GBS1185. 

OS Streptococcus agalactiae (serotype III). 

OC Bacteria; Firmicutes; Lactobacillales ; Streptococcaceae; 

OC Streptococcus. 

OX NCBI_TaxI D=2 164 95; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=NEM3 1 6 / Serotype III; 

RX MEDLINE=22242508; PubMed=12354221 ; 

RA Glaser P., Rusniok C, Buchrieser C, Chevalier F . , Frangeul L. T 

RA Msadek T., Zouine M. , Couve E., Lalioui L., Poyart C, Trieu-Cuot P 

RA Kunst F. ; 

RT "Genome sequence of Streptococcus agalactiae, a pathogen causing 

RT invasive neonatal disease. 1 '; 

RL Mol. Microbiol. 45:1499-1513(2002). 

DR EMBL; AL766849; CAD46844.1; -. 

DR SagaList; gbsll85; -. 

DR GO; GO: 0005576; C : extracellular ; IEA. 

DR GO; GO: 0016020; Crmembrane; IEA. 

DR GO; GO: 0005179; F:hormone activity; IEA. 

DR InterPro; IPR000187; corticoliberin . 

DR InterPro; IPR000620; DUF6. 



DR InterPro; IPR004626; RarD. 

DR Pfam; PF00892; DUF6; 1. 

DR TIGRFAMs; TIGR00688; rarD; 1. 

DR PROSITE; PS00511; CRF; 1. 

KW Hypothetical protein; Complete proteome . 

SQ SEQUENCE 295 AA; 33015 MW; 60DDE324 099DD3 14 CRC64 ; 

Query Match 85.4%; Score 35; DB 16; Length 295; 

Best Local Similarity 75.0%; PrecL No. 38; 

Matches 6; Conservative 2; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

: I I I I : I I 
Db 196 IVFFAKDF 2 03 



RESULT 
Q8DZI3 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 



11 



295 AA. 



OC 
OC 
OX 
RN 
RP 
RC 
RX 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RT 
RT 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 



Created) 

Last sequence update) 
Last annotation update) 



Q8DZI3 PRELIMINARY; PRT; 

Q8DZI3; 

01-MAR-2003 (TrEMBLrel. 23, 
01-MAR-2003 (TrEMBLrel. 23, 
01-OCT-2003 (TrEMBLrel. 25, 
RarD protein. 
RARD OR SAG1118. 

Streptococcus agalactiae (serotype V) . 

Bacteria; Firmicutes; Lactobacillales ; Streptococcaceae; 
Streptococcus . 
NCBI_TaxID=216466; 
[1] 

SEQUENCE FROM N.A. 
STRAIN=2603 V/R / Serotype V; 
MEDLINE=2 2222 98 8; PubMed=l 2 200547; 

Tettelin H., Masignani V., Cieslewicz M.J., Eisen J. A., Peterson S. 
Wessels M.R., Paulsen I.T., Nelson K.E., Margarit 
Madoff L.C., Wolf A.M., Beanan M.J., Brinkac L.M. 
DeBoy R.T., Durkin A. S . , Kolonay J.F., Madupu R. , 
Radune D., Fedorova N.B., Scanlan D. , Khouri H., Mulligan S., 
Carty H.A. , Cline R.T., Van Aken S.E., Gill J., Scarselli M. , Mora M. , 
Iacobini E.T., Brettoni C, Galli G. , Mariani M. , Vegni F. , Maione D., 
Rinaudo D., Rappuoli R., Telford J.L., Kasper D.L., Grandi G. , 
Fraser CM. ; 

"Complete genome sequence and comparative genomic analysis of an 
emerging human pathogen, serotype V Streptococcus agalactiae."; 
Proc. Natl. Acad. Sci. U.S.A. 99:12391-12396(2002). 
EMBL; AE014243; AAM99999.1; -. 
TIGR; SAG1118; -. 

GO; GO: 0005576; C : extracellular ; IEA. 

GO; GO: 0016020; Crmembrane; IEA. 

GO; GO: 0005179; F: hormone activity; IEA. 



. , Read T.D. , 
, Daugherty S.C., 
Lewis M. R. , 



InterPro; IPR000187 
InterPro; IPR00062 0 
InterPro; IPR004 626 
Pfam; PF00892; DUF6 
TIGRFAMs; TIGR00688 
PROSITE; PS00511; CRF; 
Complete proteome. 



corticoliberin 
DUF6. 
RarD. 
1. 

rarD; 1. 
1. 



SQ SEQUENCE 295 AA; 33015 MW; 60DDE324099DD314 CRC64 ; 



Query Match 85.4%; Score 35; DB 16; Length 295; 

Best Local Similarity 75.0%; Pred. No. 38; 

Matches 6; Conservative 2; Mismatches 0; Indels 0; Gaps 

Qy 1 LVFFAEDF 8 

: I I I I : I I 

Db 196 IVFFAKDF 203 



RESULT 
Q8UUI8 



12 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OC 
OX 
RN 
RP 
RC 
RX 
RA 
RT 
RT 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
FT 
SQ 



Q8UUI8 
Q8UUI8; 
01-MAR-2002 
01-MAR-2002 
01-JUN-2003 



PRELIMINARY; 



PRT; 



357 AA. 



Created) 

Last sequence update) 
Last annotation update) 



(TrEMBLrel. 20, 
(TrEMBLrel. 20, 
(TrEMBLrel. 24, 
Putative mebrane protein (Fragment) . 
APPA. 

Brachydanio rerio (Zebrafish) (Danio rerio) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Actinopterygii; Neopterygii; Teleostei; Os tariophysi ; Cyprinif ormes ; 
Cyprinidae; Danio. 
NCBI_TaxID=7955; 
[1] 

SEQUENCE FROM N.A. 

TISSUE=Embryo; 

PubMed=11862463; 

Musa A., Lehrach H., Russo V.E.A. ; 

"Distinct expression patterns of two zebrafish homologues of the humai 

APP gene during embryonic development."; 

Dev. Genes Evol . 211:563-567(2001). 

EMBL; AJ315637; CAC85734.1; -. 

ZFIN; ZDB-GENE-000616-13; appa. 

GO; GO: 0016020; C:membrane; IEA. 

InterPro; IPR008155; A4_APP. 

InterPro; IPR001255; Beta-APP. 

Pfam; PF03494; Beta-APP; 1. 

PRINTS; PR00203; AMYLOIDA4. 

PROSITE; PS00320; A4_INTRA; 1. 

NON_TER 1 1 

SEQUENCE 357 AA; 40962 MW; 07D99EEF6C55B2D8 CRC64; 



Query Match 85.4%; Score 35; DB 13; Length 357; 

Best Local Similarity 100.0%; Pred. No. 46; 

Matches 7; Conservative 0; Mismatches 0; Indels 



0; Gaps 



Qy 

Db 



1 LVFFAED 7 

I I I I I I I 
275 LVFFAED 281 



RESULT 13 
020025 

ID 020025 PRELIMINARY; PRT; 361 AA. 

AC 020025; 



DT 
DT 
DT 
DE 
GN 
OS 
OG 
OC 
OC 
OC 
OC 
OX 
RN 
RP 
RA 
RT 
RT 
RL 
CC 
CC 
CC 
CC 
CC 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
SQ 



01-JAN-1998 (TrEMBLrel. 05, Created) 

01-JAN-1998 (TrEMBLrel. 05, Last sequence update) 

01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

Intron maturase (Maturase K) (Fragment) . 

MATK. 

Crithmum maritimum (samphire) . 
Chloroplast . 

Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; 

Spermatophyta; Magnoliophyta; eudicotyledons ; core eudicots; asterids; 

campanulids; Apiales; Apiaceae; Apioideae; apioid super clade; 

Pyramidoptereae; Crithmum. 

NCBI_TaxID=40916; 

[1] 

SEQUENCE FROM N.A. 

Plunkett G.M., Soltis D.E., Soltis P.S.; 

"Evolutionary patterns in Apiaceae: inferences based on matK sequence 
data 

Syst. Bot. 21:477-495(1996). 

-!- FUNCTION: PROBABLY ASSISTS IN SPLICING CHLOROPLAST GROUP II 

INTRONS (BY SIMILARITY) . 
-!- SIMILARITY: WITH CORRESPONDING ORF IN OTHER PLANT CHLOROPLASTS , 

AND REGIONS OF SIMILARITY TO MATURASE-LIKE POLYPEPTIDES ENCODED BY 

MITOCHONDRIAL INTRONS. 
EMBL; U58558; AAB66262 . 1 ; -. 
GO; GO: 0009507; C : chloroplast ; IEA. 
GO; GO:0006397; P:mRNA processing; IEA. 
InterPro; IPR002866; MatK_N. 
Pfam; PF01824; MatK_N; 1. 
mRNA processing; Chloroplast. 
NON_TER 361 361 

SEQUENCE 361 AA; 42847 MW; 43A0657ED3134DEA CRC64 ; 



Query Match 85.4%; 
Best Local Similarity 75.0%; 
Matches 6; Conservative 



Score 35; DB 
Pred. No. 46; 
1; Mismatches 



8; Length 361; 
1; Indels 



0; Gaps 



0; 



Qy 

Db 



1 LVFFAEDF 8 

1:111 II 

73 LIFFANDF 80 



RESULT 14 
020011 

ID 020011 PRELIMINARY; PRT; 361 AA. 

AC 020011; 

DT 01-JAN-1998 (TrEMBLrel. 05, Created) 

DT 01-JAN-1998 (TrEMBLrel. 05, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Intron maturase (Maturase K) (Fragment) . 

GN MATK. 

OS Anthriscus sylvestris. 

OG Chloroplast. 

OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; 

OC Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; asterids; 

OC campanulids; Apiales; Apiaceae; Apioideae; Scandiceae; Scandicinae; 

OC Anthriscus. 

OX NCBI TaxID=48027; 



RN [1] 

RP SEQUENCE FROM N.A. 

RA Plunkett G.M., Soltis D.E., Soltis P.S.; 

RT "Evolutionary patterns in Apiaceae: inferences based on matK sequence 

RT data."; 

RL Syst. Bot. 21:477-495(1996). 

CC -!- FUNCTION: PROBABLY ASSISTS IN SPLICING CHLOROPLAST GROUP II 
CC INTRONS (BY SIMILARITY) . 

CC -!- SIMILARITY: WITH CORRESPONDING ORF IN OTHER PLANT CHLOROPLASTS , 

CC AND REGIONS OF SIMILARITY TO MATURASE-LIKE POLYPEPTIDES ENCODED BY 

CC MITOCHONDRIAL INTRONS. 

DR EMBL; U58547; AAB66255.1; -. 

DR GO; GO: 0009507; C : chloroplas t ; IEA. 

DR GO; GO:0006397; P:mRNA processing; IEA. 

DR InterPro; IPR002866; MatK_N. 

DR Pfam; PF01824; MatK_N; 1. 

KW mRNA processing; Chloroplast. 

FT NON_TER 361 361 

SQ SEQUENCE 361 AA; 43334 MW; D1A875A9910B6F21 CRC64 ; 

Query Match 85.4%; Score 35; DB 8; Length 361; 

Best Local Similarity 75.0%; Pred. No. 46; 

Matches 6; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

1:111 II 
Db 73 LIFFANDF 8 0 



RESULT 15 
020068 

ID 020068 PRELIMINARY; PRT; 364 AA. 

AC 020068; 

DT 01-JAN-1998 (TrEMBLrel . 05, Created) 

DT 01-JAN-1998 (TrEMBLrel. 05, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Intron maturase (Maturase K) (Fragment) . 

GN MATK. 

OS Neogoezia minor. 

OG Chloroplast. 

OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; 

OC Spermatophyta; Magnoliophyta; eudicotyledons ; core eudicots; asterids; 

OC campanulids; Apiales; Apiaceae; Apioideae; Oenantheae; Neogoezia. 

OX NCBI_TaxID=46372; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Plunkett G.M., Soltis D.E., Soltis P.S.; 

RT "Evolutionary patterns in Apiaceae: inferences based on matK sequence 

RT data . " ; 

RL Syst. Bot. 21:477-4 95(1996). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Plunkett G.M., Soltis D.E., Soltis P.S.; 

RT "Clarification of the relationship between Apiaceae and Araliaceae 

RT based on matK and rbcL sequence data."; 

RL Am. J. Bot. 84:565-580(1997). 

CC -!- FUNCTION: PROBABLY ASSISTS IN SPLICING CHLOROPLAST GROUP II 



CC I NT RONS (BY SIMILARITY) . 

CC -!- SIMILARITY: WITH CORRESPONDING ORF IN OTHER PLANT CHLOROPLASTS, 

CC AND REGIONS OF SIMILARITY TO MATURASE-LIKE POLYPEPTIDES ENCODED BY 

CC MITOCHONDRIAL INTRONS . 

DR EMBL; U58570; AAB66281.1; -. 

DR GO; GO: 0009507; C: chloroplast; IEA. 

DR GO; GO: 0006397; P:mRNA processing; IEA. 

DR InterPro; IPR002866; MatK_N . 

DR Pfam; PF01824; MatK_N; 1. 

KW mRNA processing; Chloroplast. 

FT NON_TER 364 364 

SQ SEQUENCE 364 AA; 42939 MW; DBABC9499ED36646 CRC64; 

Query Match 85.4%; Score 35; DB 8; Length 364; 

Best Local Similarity 75.0%; Pred. No. 47; 

Matches 6; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

1:111 II 
Db 76 LIFFANDF 83 



RESULT 16 
Q8BPC7 

ID Q8BPC7 PRELIMINARY; PRT; 384 AA. 

AC Q8BPC7 ; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Amyloid beta (Fragment) . 

GN APP. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Head; 

RX MEDLINE=22354 683; PubMed=12466851; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

DR EMBL; AK076506; BAC36369.1; 

DR MGD; MGI: 88059; App . 

DR GO; GO: 0005515; F:protein binding; IPI. 

DR InterPro; IPR008155; A4_APP. 

DR InterPro; IPR001255; Beta-APP. 

DR Pfam; PF03494; Beta-APP; 1. 

DR PRINTS; PR00203; AMYLOIDA4 . 

DR PROSITE; PS00320; A4_INTRA; 1. 

FT NON_TER 1 1 

SQ SEQUENCE 384 AA; 43990 MW; A81B1AD8AE683173 CRC64 ; 



Query Match 85.4%; Score 35; DB 11; Length 384; 

Best Local Similarity 100.0%; Pred. No. 49; 



Matches 



7; Conservative 



0; Mismatches 



0; Indels 



0; Gaps 



0; 



Qy 

Db 



1 LVFFAED 7 

I I I I I I I 
302 LVFFAED 308 



RESULT 17 
Q8UUS0 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OC 
OX 
RN 
RP 
RC 
RX 
RA 
RT 
RT 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
FT 
SQ 



472 AA. 



Q8UUS0 PRELIMINARY; PRT; 

Q8UUS0; 

01-MAR-2002 (TrEMBLrel. 20, Created) 

01-MAR-2002 (TrEMBLrel. 20, Last sequence update) 

01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

Putative membrane protein (Fragment) . 

APPA. 

Brachydanio rerio (Zebrafish) (Danio rerio) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Actinopterygii; Neopterygii; Teleostei; Ostariophysi ; Cyprinif ormes ; 
Cyprinidae; Danio. 
NCBI_TaxID=7955; 
[1] 

SEQUENCE FROM N.A. 

TISSUE=Brain; 

PubMed=11862463; 

Musa A., Lehrach H., Russo V.E.A.; 

"Distinct expression patterns of two zebrafish homologues of the human 

APP gene during embryonic development . 11 ; 

Dev. Genes Evol. 211:563-567(2001). 

EMBL; AJ315636; CAC85733.1; 

ZFIN; ZDB-GENE-000616-13; appa . 

GO; GO: 0016020; C:membrane; IEA. 

InterPro; IPR008155; A4_APP. 

InterPro; IPR001255; Beta-APP. 

Pfam; PF03494; Beta-APP; 1. 

PRINTS; PR00203; AMYLOIDA4 . 

PROSITE; PS00320; A4_INTRA; 1. 

NON_TER 1 1 

SEQUENCE 472 AA; 53787 MW; 24F7128BE3356550 CRC64; 



Query Match 85.4%; Score 35; 

Best Local Similarity 100.0%; Pred. No. 



DB 13; Length 472; 
61; 



Matches 



7; Conservative 



0; Mismatches 



0; Indels 



0; Gaps 



0; 



QY 
Db 



1 LVFFAED 7 

I I I I I I I 

390 LVFFAED 396 



RESULT 
093296 



18 



PRELIMINARY; 



ID 093296 

AC 093296; 

DT 01-NOV-1998 (TrEMBLrel. 08, 

DT 01-NOV-1998 (TrEMBLrel. 08, 

DT 01-JUN-2003 (TrEMBLrel. 24, 

DE Amyloid protein (Fragment) . 



PRT; 



534 AA. 



Created) 

Last sequence update) 
Last annotation update) 



OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostorni; 

OC Archosauria; Aves; Neognathae; Gallif ormes ; Phasianiciae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98337885; PubMed=9671674 ; 

RA Barnes N.Y., Li L., Yoshikawa K. , Schwartz L.M., Oppenheim R.W., 

RA Milligan C.E. ; 

RT "Increased production of amyloid precursor protein provides a 

RT substrate for caspase-3 in dying motoneurons."; 

RL J. Neurosci. 18:5869-5880(1998). 

DR EMBL; AF042098; AAC25052.1; -. 

DR HSSP; P05067; 1BA4 . 

DR GO; GO: 0016020; C:membrane; IEA. 

DR InterPro; IPR008155; A4_APP. 

DR InterPro; IPR008154; A4_extra. 

DR InterPro; IPR001255; Beta-APP. 

DR Pfam; PF02177; A4_EXTRA; 1. 

DR Pfam; PF03494; Beta-APP; 1. 

DR PRINTS; PR00203; AMYLOIDA4 . 

DR PROSITE; PS00319; A4_EXTRA; 1. 

DR PROSITE; PS00320; A4_INTRA; 1. 

FT NONJTER 1 1 

SQ SEQUENCE 534 AA; 60597 MW; FB53ECC2E66D4C92 CRC64; 



Query Match 85.4%; Score 35; DB 13; Length 534; 

Best Local Similarity 100.0%; Pred. No. 69; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 LVFFAED 7 

I I I I I I I 

Db 452 LVFFAED 458 



RESULT 19 
Q9PVL1 

ID Q9PVL1 PRELIMINARY; PRT; 569 AA. 

AC Q9PVL1; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Amyloid protein (Fragment) . 

GN APP. 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostorni; 

OC Archosauria; Aves; Neognathae; Gallif ormes; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE-Brain; 

RA Coulson E.J., Paliga K. , Beyreuther K. f Masters C.L.; 

RT "What the evolution of the amyloid protein precursor supergene family 

RT tells us about its function."; 

RL Neurochem. Int. 0:0-0(2000). 



DR EMBL; AF030341; AAF12698.1; -. 

DR HSSP; P05067; 1BA4 . 

DR GO; GO: 0016020; Crmembrane; IEA. 

DR InterPro; IPR008155; A4_APP. 

DR InterPro; IPR008154; A4_extra. 

DR InterPro; IPR001255; Beta-APP. 

DR Pfam; PF02177; A4_EXTRA; 1. 

DR Pfam; PF03494; Beta-APP; 1. 

DR PRINTS; PR00203; AMYLOIDA4 . 

DR PROSITE; PS00319; A4_EXTRA; 1. 

DR PROSITE; PS00320; A4_INTRA; 1. 

FT NON_TER 1 1 

SQ SEQUENCE 569 AA; 64753 MW; 0AB8BB8518 63A19D CRC64; 



Query Match 85.4%; Score 35; DB 13; Length 569; 

Best Local Similarity 100.0%; Pred. No. 74; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gap 



Qy 1 LVFFAED 7 

I I I I I I I 
Db 488 LVFFAED 494 



RESULT 20 
Q9I9E7 

ID Q9I9E7 PRELIMINARY; PRT; 612 AA. 

AC Q9I9E7; 

DT 01-OCT-2000 (TrEMBLrel. 15, Created) 

DT 01-OCT-2000 (TrEMBLrel. 15, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Amyloid protein (Fragment) . 

GN APPA. 

OS Brachydanio rerio (Zebrafish) (Danio rerio) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi ; Cyprinif ormes 

OC Cyprinidae; Danio. 

OX NCBI_TaxID=7 955; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Slavov D.B., Gardiner K. ; 

RT "An App cDNA from Zebrafish (Danio rerio)."; 

RL Submitted (APR-2000) to the EMBL/GenBank/DDB J databases. 

DR EMBL; AF257742; AAF71748.1; -. 

DR HSSP; P05067; 1HZ3. 

DR ZFIN; ZDB-GENE-000616-13 ; appa . 

DR GO; GO: 0016020; Crmembrane; IEA. 

DR InterPro; IPR008155; A4_APP. 

DR InterPro; IPR008154; A4_extra. 

DR InterPro; IPR001255; Beta-APP. 

DR Pfam; PF02177; A4_EXTRA; 1. 

DR Pfam; PF03494; Beta-APP; 1. 

DR PRINTS; PR00203; AMYLOIDA4. 

DR PROSITE; PS00319; A4_EXTRA; 1. 

DR PROSITE; PS00320; A4__INTRA; 1. 

FT NONJIER 1 1 

SQ SEQUENCE 612 AA; 69710 MW; 59A9ACBDF9C59EFF CRC64; 



Query Match 85.4%; Score 35; DB 13; Length 612; 

Best Local Similarity 100.0%; Pred. No. 79; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gap 



Qy 



1 LVFFAED 7 



Db 



530 LVFFAED 536 



RESULT 21 

Q7ZZT1 

ID Q7ZZT1 



PRELIMINARY; 



PRT; 



678 AA. 



AC Q7ZZT1; 

DT 01-JUN-2003 (TrEMBLrel . 24, Created) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Amyloid protein a variant 2. 

GN APPA. 

OS Brachydanio rerio (Zebrafish) (Danio rerio) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi ; Cyprinif ormes 

OC Cyprinidae; Danio. 

OX NCBI_TaxID=7 955 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Groth C, Lardelli M.; 

RT "Investigation of zebrafish appa expression during embryogenesis . " ; 

RL Submitted (APR-2003) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AY271746; AAP22958.1; -. 

DR GO; GO: 0016020; C:membrane; IEA. 

DR InterPro; IPR008155; A4_APP. 

DR InterPro; IPR008154; A4_extra. 

DR InterPro; IPR001255; Beta-APP. 

DR Pfam; PF02177; A4_EXTRA; 1. 

DR Pfam; PF03494; Beta-APP; 1. 

DR PRINTS; PR00203; AMYLOIDA4. 

DR SMART; SM00006; A4_EXTRA; 1. 

DR PROSITE; PS00319; A4_EXTRA; 1. 

DR PROSITE; PS00320; A4_INTRA; 1. 

SQ SEQUENCE 678 AA'; 76755 MW; 94 163778 444FD0BC CRC64 ; 

Query Match 85.4%; Score 35; DB 13; Length 678; 

Best Local Similarity 100.0%; Pred. No. 88; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gap 
Qy 1 LVFFAED 7 



Db 



596 LVFFAED 602 



RESULT 22 
Q9DGJ8 

ID Q9DGJ8 PRELIMINARY; 
AC Q9DGJ8; 

DT 01-MAR-2001 (TrEMBLrel. 16, 
DT 01-MAR-2001 (TrEMBLrel. 16, 
DT 01-JUN-2003 (TrEMBLrel. 24, 



Created) 

Last sequence update) 
Last annotation update) 



PRT; 



695 AA. 



DE Beta- amyloid precursor protein 695 isoform. 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Archosauria; Aves; Neognathae; Gallif ormes ; Phasianidae; Phasianinae 

OC Gallus . 

OX NCBI__TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Sarasa M. , Rodolosse A., Sorribas V.; 

RT "Cloning of full-length chicken beta-amyloid precursor protein 

RT isof orms . " ; 

RL Submitted (JUL-2000) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF289218; AAG00593.1; -. 

DR HSSP; P05067; 1BA4 . 

DR GO; GO: 0016020; C:membrane; IEA. 

DR InterPro; IPR008155; A4__APP. 

DR InterPro; IPR008154; A4_extra. 

DR InterPro; IPR001255; Beta-APP. 

DR Pfam; PF02177; A4_EXTRA; 1. 

DR Pfam; PF03494; Beta-APP; 1. 

DR PRINTS; PR00203; AMYLOIDA4. 

DR SMART; SM00006; A4_EXTRA; 1. 

DR PROSITE; PS00319; A4_EXTRA; 1. 

DR PROSITE; PS00320; A4_INTRA; 1. 

SQ SEQUENCE 695 AA; 78565 MW; F201ED02AEC86D95 CRC64; 

Query Match 85.4%; Score 35; DB 13; Length 695; 

Best Local Similarity 100.0%; Pred. No. 90; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 LVFFAED 7 

I I I I I I I 

Db 613 LVFFAED 619 



RESULT 23 
Q90W28 

ID Q90W28 PRELIMINARY; PRT; 738 AA. 

AC Q90W28; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Amyloid precursor protein. 

GN APPA OR APP. 

OS Brachydanio rerio (Zebrafish) (Danio rerio) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi ; Cyprinif ormes ; 

OC Cyprinidae; Danio. 

OX NCBI_TaxID=7955; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Groth C, Lardelli M. ; 

RT "Expression analysis of zebrafish app."; 

RL Submitted (JUN-2001) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF389401; AAK64495.1; 

DR ZFIN; ZDB-GENE-000616-13 ; appa . 

DR GO; GO: 0016020; C:membrane; IEA. 



DR GO; GO: 0004867; F: serine protease inhibitor activity; IEA. 

DR InterPro; IPR008155; A4_APP. 

DR InterPro; IPR008154; A4_extra. 

DR InterPro; IPR001255; Beta-APP. 

DR InterPro; IPR002223; Kunitz_BPTI. 

DR Pfam; PF02177; A4_EXTRA; 1. 

DR Pfam; PF03494; Beta-APP; 1. 

DR Pfam; PF00014; Kunitz_BPTI; 1. 

DR PRINTS; PR00203; AMYLOIDA4. 

DR PRINTS; PR00759; BASICPTASE. 

DR ProDom; PD000222; Kunitz_BPTI; 1. 

DR SMART; SM00006; A4_EXTRA; 1. 

DR SMART; SM00131; KU; 1. 

DR PROSITE; PS00319; A4_EXTRA; 1. 

DR PROSITE; PS00320; A4_INTRA; 1. 

DR PROSITE; PS00280; BPTI_KUNITZ_1 ; 1. 

DR PROSITE; PS50279; BPTI_KUNITZ_2 ; 1. 

KW Protease inhibitor; Serine protease inhibitor. 

SQ SEQUENCE 738 AA; 83577 MW; AF4 80F6D308FD298 CRC64 ; 

Query Match 85.4%; Score 35; DB 13; Length 738; 

Best Local Similarity 100.0%; Pred. No. 96; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 LVFFAED 7 

I I I I I I I 

Db 656 LVFFAED 662 



RESULT 24 
Q9DGJ7 

ID Q9DGJ7 PRELIMINARY; PRT; 751 AA. 

AC Q9DGJ7; 

DT 01-MAR-2001 (TrEMBLrel. 16, Created) 

DT 01-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Beta-amyloid precursor protein 751 isoform. 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves ; Neognathae; Galliformes; Phasianidae; Phasianinae 

OC Gallus. 

OX NCBI_TaxID=9031 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Sarasa M. , Rodolosse A. , Sorribas V. ; 

RT "Cloning of full-length chicken beta-amyloid precursor protein 

RT isoforms."; 

RL Submitted (JUL-2 000) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AF289219; AAG00594.1; -. 

DR HSSP; P05067; 1BA4 . 

DR GO; GO: 0016020; Crmembrane; IEA. 

DR GO; GO: 0004867; F: serine protease inhibitor activity; IEA. 

DR InterPro; IPR008155; A4_APP. 

DR InterPro; IPR008154; A4_extra. 

DR InterPro; IPR001255; Beta-APP. 

DR InterPro; IPR002223; Kunitz_BPTI. 

DR Pfam; PF02177; A4 EXTRA; 1. 



DR Pfam; PF03494; Beta-APP; 1. 

DR Pfam; PF00014; Kunitz_BPTI ; 1. 

DR PRINTS; PR00203; AMYL0IDA4. 

DR PRINTS; PR00759; BASICPTASE. 

DR ProDom; PD000222; Kunitz_BPTI; 1. 

DR SMART; SM00006; A4_EXTRA; 1. 

DR SMART; SM00131; KU; 1. 

DR PROSITE; PS00319; A4_EXTRA; 1. 

DR PROSITE; PS00320; A4_INTRA; 1. 

DR PROSITE; PS00280; BPTI_KUNITZ_1 ; 1. 

DR PROSITE; PS50279; BPTI_KUNITZ_2 ; 1. 

KW Protease inhibitor; Serine protease inhibitor. 

SQ SEQUENCE 751 AA; 84705 MW; E78E9413A8033D84 CRC64; 

Query Match 85.4%; Score 35; DB 13; Length 751; 

Best Local Similarity 100.0%; Pred. No. 98; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 LVFFAED 7 

I I I I I I I 

Db 669 LVFFAED 675 



RESULT 25 
Q8T9D3 

ID Q8T9D3 PRELIMINARY; PRT; 1169 AA. 

AC Q8T9D3; 

DT 01-JUN-2002 (TrEMBLrel. 21, Created) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE SD05642p. 

GN CG7161. 

OS Drosophila melanogaster (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophilidae; Drosophila. 

OX NCBI_TaxID=7227 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Stapleton M. , Brokstein P., Hong L., Agbayani A., Carlson J., 

RA Champe M. , Chavez C, Dorsett V., Farfan D., Frise E. f George R. , 

RA Gonzalez M. , Guarin H . , Li P., Liao G. , Miranda A., Mungall C.J., 

RA Nunoo J., Pacleb J., Paragas V., Park S., Phouanenavong S., Wan K., 

RA Yu C, Lewis S.E., Rubin G.M., Celniker S.; 

RL Submitted (DEC-2001) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AY069816; AAL39961.1; 

DR FlyBase; FBgn0035891; CG7161. 

DR InterPro; IPR001680; WD40. 

DR Pfam; PF00400; WD40; 4. 

DR ProDom; PD000018; WD40; 1. 

DR SMART; SM00320; WD4 0; 5. 

DR PROSITE; PS50082; WD_REPEATS_2 ; 1. 

DR PROSITE; PS50294; WD_RE P EAT S_REGI ON ; 1. 

KW Repeat; WD repeat. 

SQ SEQUENCE 1169 AA; 131625 MW; 0FF6AC96B32B6C07 CRC64; 



Query Match 



85.4%; Score 35; DB 5; Length 1169; 



Best Local Similarity 75.0%; Pred. No. 1.5e+02; 

Matches 6; Conservative 2; Mismatches 0; Indels 0; Gaps 



Qy 1 LVFFAEDF 8 

I : I : I I I I 

Db 1138 LLFYAEDF 1145 



RESULT 26 
Q9VSJ6 

ID Q9VSJ6 PRELIMINARY; PRT; 1169 AA. 

AC Q9VSJ6; 

DT 01-MAY-2000 (TrEMBLrel . 13, Created) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE CG7161-PA. 

GN CG7161. 

OS Drosophila melanogaster (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophilidae; Drosophila. 

OX NCBI_TaxID=7227; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=2 0196006; PubMed=10731132 ; 

RA Adams M.D., Celniker S.E., Holt R.A. , Evans C.A., Gocayne J.D., 

RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A. , Galle R.F., 

RA George R.A. , Lewis S.E., Richards S., Ashburner M. , Henderson S.N., 

RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q. , Chen L.X., 

RA Brandon R.C., Rogers Y.H., Blazej R.G., Champe M. , Pfeiffer B.D., 

RA Wan K.H., Doyle C, Baxter E.G., Helt G., Nelson C.R., Gabor G.L., 

RA Abril J.F., Agbayani A., An H.J., Andrews-Pf annkoch C, Baldwin D. , 

RA Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L. , Beasley E.M. , 

RA Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., 

RA Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., 

RA Burtis K.C., Busam D.A. , Butler H . , Cadieu E., Center A., Chandra I. 

RA Cherry J.M., Cawley S., Dahlke C. , Davenport L.B., Da vies P., 

RA de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., 

RA Dodson K., Doup L.E., Downes M. , Dugan-Rocha S., Dunkov B.C., Dunn P 

RA Durbin K.J., Evangelista C.C., Ferraz C, Ferriera S., Fleischmann W 

RA Fosler C, Gabrielian A.E., Gar*g N.S., Gelbart W.M., Glasser K. , 

RA Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M. , 

RA Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., 

RA Hostin D., Houston K.A. , Howland T.J., Wei M.H., Ibegwam C, 

RA Jalali M. , Kalush F., Karpen G.H., Ke Z., Kennison J. A., Ketchum K.A 

RA Kimmel B.E., Kodira CD., Kraft C, Kravitz S., Kulp D., Lai Z., 

RA Lasko P., Lei Y., Levitsky A. A., Li J., Li Z., Liang Y. , Lin X., 

RA Liu X., Mattei B., Mcintosh T.C., McLeod M.P., McPherson D., 

RA Merkulov G., Milshina N.V., Mobarry C, Morris J., Moshrefi A., 

RA Mount S.M., Moy M. , Murphy B., Murphy L., Muzny D.M., Nelson D.L., 

RA Nelson D.R., Nelson K.A. , Nixon K., Nusskern D.R., Pacleb J.M., 

RA Palazzolo M. , Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., 

RA Reinert K. , Remington K., Saunders R.D., Scheeler F., Shen H., 

RA Shue B.C., Siden-Kiamos I., Simpson M. , Skupski M.P., Smith T., 

RA Spier E . , Spradling A.C., Stapleton M. , Strong R-, Sun E., 

RA Svirskas R. , Tector C, Turner R. , Venter E., Wang A.H., Wang X., 

RA Wang Z.Y., Wassarman D.A. , Weinstock G.M., Weissenbach J., 



RA 
PA 
RA 
RA 
RT 
RL 
RN 
RP 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RA 
RT 
RL 
RN 
RP 
RA 
RL 
RN 
RP 
RA 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
SQ 



Bayraktaroglu L., Campbell K., 
Prochnik S.E., Smith CD., 
Carlson J.W. , Celniker S.E., 
Frise E., de Grey A. D.N. J. , 

Richter J . , 

Mungall C.J. 



Williams S.M., WoodageT, Worley K.C, Wu D . , Yang 
Yeh R.F., Zaveri J.S., Zhan M. , Zhang G., Zhao Q. , 
Zheng X.H., Zhong F.N., Zhong W. f Zhou X., Zhu S., 
Gibbs R.A. , Myers E.W., Rubin CM., Venter J.C.; 
"The genome sequence of Drosophila melanogas ter . " ; 
Science 287:2185-2195(2000). 
[2] 

SEQUENCE FROM N.A. 

Misra S., Crosby M.A. , Matthews B.B., 
Hradecky P., Huang Y. , Kaminker J.S., 
Tupy J.L., Bergman CM-, Berman B.P., 
Clamp M.E., Drysdale R.A. , Emmert D. , 

Harris N.L., Kronmiller B. f Marshall B., Millburn G.H., 
Russo S., Searle S .M. J. r , Smith E., Shu S., Smutniak F., 
Whitfield E.J., Ashburner M. , Gelbart W.M. , Rubin G.M., 
Lewis S . E. ; 

"Annotation of Drosophila melanogaster genome."; 
Submitted (MAR-2000) to the EMBL/GenBank/DDB J databases. 
[3] 

SEQUENCE FROM N.A. 
FlyBase; 

Submitted (SEP-2002) to the EMBL/GenBank/DDB J databases. 
[4] 

SEQUENCE FROM N.A. 
FlyBase; 

Submitted (JAN-2003) to the EMBL/GenBank/DDB J databases. 

EMBL; AE003555; AAF50423.3; -. 

InterPro; IPR001680; WD40. 

Pfam; PF00400; WD40; 3. 

ProDom; PD000018; WD40; 1. 

SMART; SM00320; WD40; 4, 

PROSITE; PS50082; WD_REPEATS_2 ; 1. 

PROSITE; PS50294; WD_REPEATS_REGION; 1. 

SEQUENCE 1169 AA; 131634 MW; 920176314C01C37D CRC64; 



S . , Yao Q. A. f Ye J. , 
Zheng L. , 

Zhu X. , Smith H.O. , 



Query Match 85.4%; 
Best Local Similarity 75.0%; 
Matches 6; Conservative 



Score 35; DB 5; Length 1169; 
Pred. No. 1. 5e+02; 
2 ; Mismatches 0 ; Indels 



0; Gaps 



0; 



Qy 

Db 



1 LVFFAEDF 8 

I : I : I I I I 
1138 LLFYAEDF 1145 



RESULT 27 
Q21915 

ID Q21915 PRELIMINARY; PRT; 222 AA. 

AC Q21915; 

DT 01-NOV-1996 (TrEMBLrel. 01, Created) 

DT 01-NOV-1996 (TrEMBLrel. 01, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE R10H10.1 protein. 

GN R10H10.1. 

OS Caenorhabditis elegans . 

OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea; 

OC Rhabditidae; Peloderinae; Caenorhabditis. 

OX NCBI TaxID=6239; 



t 



RN [1] 

RP SEQUENCE FROM N.A. 

RA Bardill S.C.; 

RL Submitted (APR-1996) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99069613; PubMed-9851916; 

RA none; 

RT "Genome sequence of the nematode C.elegans: A platform for 

RT investigating biology."; 

RL Science 282:2012-2018(1998). 

DR EMBL; Z70686; CAA94609.1; 

DR PIR; T24151; T24151. 

DR WormPep; R10H10.1; CE06292. 

DR InterPro; IPR001075; NifU_C. 

DR Pfam; PF01106; NifU-like; 1. 

DR ProDom; PD002830; NifU_C; 1. 

SQ SEQUENCE 222 AA; 24757 MW; ED232B974B03E8D0 CRC64; 



Query Match 80.5%; 
Best Local Similarity 85.7%; 
Matches 6; Conservative 



Score 33; DB 5; Length 222; 
Pred. No. 74; 
0; Mismatches 1; Indels 



0; Gaps 



0; 



Qy 



Db 



2 VFFAEDF 8 

III III 

71 VFFGEDF 77 



RESULT 2 8 
Q9XSI7 

ID Q9XSI7 PRELIMINARY; PRT; 261 AA. 

AC Q9XSI7; 

DT 01-NOV-1999 (TrEMBLrel. 12, Created) 

DT 01-NOV-1999 (TrEMBLrel. 12, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE MT1-MMP protein (Fragment) . 

GN MMP-14. 

OS Bos taurus (Bovine) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla ; Ruminantia; Pecora; Bovoidea; 

OC Bovidae; Bovinae; Bos. 

OX NCBI_TaxID=9913; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE^Skeletal muscle; 

RX MEDLINE=21071388; PubMed-112 0472 1 ; 

RA Balcerzak D., Querenguesser L., Dixon W.T., Baracos V.E.; 

RT "Coordinate expression of matrix-degrading proteinases and their 

RT activators and inhibitors in bovine skeletal muscle."; 

RL J. Anim. Sci. 79:94-107(2001). 

DR EMBL; AF144758; AAD30298.1; -. 

DR HSSP; P39900; 1JIZ. 

DR MEROPS; M10.014; 

DR GO; GO:0005578; C: extracellular matrix; IEA. 

DR GO; GO: 0004222; F:metalloendopeptidase activity; IEA. 

DR GO; GO: 0008270; F:zinc ion binding; IEA. 

DR GO; GO: 0006508; P : proteolysis and peptidolysis ; IEA. 



DR InterPro; IPR000585; Hemopexin. 

DR InterPro; IPR006026; Peptidase_M. 

DR InterPro; IPR001818; Pept_M10A_M12B . 

DR InterPro; IPR006025; Pept_M__Zn_BS . 

DR Pfam; PF00045; hemopexin; 1. ~~ 

DR Pfam; PF00413; Peptidase_M10 ; 1. 

DR PRINTS; PR00138; MATRIXIN. 

DR SMART; SM00235; ZnMc; 1. 

DR PROSITE; PS00142; ZINC_PROTEASE; 1. 

KW Hydrolase; Metal-binding; Metalloprotease; Protease; Zinc 

FT NONJTER 1 1 

FT NON_TER 261 2 61 

SQ SEQUENCE 261 AA; 29854 MW; A0BF7CA8EDCF8 OIF CRC64 ; 



Query Match 80.5%; 
Best Local Similarity 75.0%; 
Matches 6; Conservative 



Score 33; DB 
Pred. No. 87; 
1 ; Mismatches 



6; Length 2 61; 
1; Indels 



0; Gaps 



0; 



Qy 

Db 



87 



LVFFAEDF 8 

I : I I I I I 

LIFFAEGF 94 



RESULT 29 






Q87M09 






ID 


Q87M09 PRELIMINARY; PRT; 448 AA. 






AC 


Q87M09; 






DT 


01-JUN-2003 (TrEMBLrel. 24, Created) 






DT 


01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 






DT 


01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 






DE 


Na+-coupled multidrug efflux pump. 






GN 


VP2449. 






OS 


Vibrio parahaemolyticus . 






OC 


Bacteria; Proteobacteria; Gammaproteobacteria; Vibrionales; 




OC 


Vibrionaceae; Vibrio. 






OX 


NCBI TaxID-670; 






RN 


[1] 






RP 


SEQUENCE FROM N.A. 






RC 


STRAIN=RIMD 2210633 / Serotype 03:K6; 






RX 


MEDLINE=22508454; PubMed=12620739 ; 






RA 


Makino K. , Oshima K . , Kurokawa K. , Yokoyama K. , Uda T. 


, Tagomori 


K., 


RA 


Iijima Y., Najima M., Nakano M. , Yamashita A., Kubota 


Y. , Kimura 




RA 


Yasunaga T., Honda T., Shinagawa H., Hattori M. , Iida 


T. ; 




RT 


"Genome sequence of Vibrio parahaemolyticus: a pathogenic mechanism 


RT 


distinct from that of V. cholerae." ; 






RL 


Lancet 361:743-749(2003). 






DR 


EMBL; AP005081; BAC60712.1; -. 






DR 


GO; GO: 0016020; C:membrane; IEA. 






DR 


GO; GO:0015297; F:antiporter activity; IEA. 






DR 


GO; GO: 0015238; F:drug transporter activity; IEA. 






DR 


GO; GO: 0006855; P:multidrug transport; IEA. 






DR 


InterPro; IPR002528; MatE. 






DR 


Pfam; PF01554; MatE; 2. 






KW 


Complete proteome. 






SQ 


SEQUENCE 448 AA; 48993 MW; 6B301E64FDB6BCC6 CRC64; 






Query Match 80.5%; Score 33; DB 16; Length 


448; 





Best Local Similarity 75.0%; Pred. No. 1.5e+02; 

Matches 6; Conservative 1; Mismatches 1; Inciels 0; Gaps 



Qy 1 LVFFAEDF 8 

I I I I : I I 
Db 105 LFFFADDF 112 



RESULT 30 
Q50563 

ID Q50563 PRELIMINARY; PRT; 455 AA. 

AC Q50563; 

DT 01-NOV-1996 (TrEMBLrel. 01, Created) 

DT 01-NOV-1998 (TrEMBLrel. 08, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Phosphomannomutase. 

GN MTH1584. 

OS Methanobacterium thermoautotrophicum. 

OC Archaea; Euryarchaeota; Methanobacteria; Methanobacteriales ; 

OC Methanobacteriaceae; Methanothermobacter . 

OX NCBIJTaxI D= 187420; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Delta H; 

RX MEDLINE=98037514; PubMed=937 14 63 ; 

RA Smith D.R., Douce tte-Stamm L.A., DeLoughery C, Lee H.-M., Dubois J., 

RA Aldredge T., Bashirzadeh R. , Blakely D., Cook R., Gilbert K., 

RA Harrison D., Hoang L . , Keagle P., Lumm W., Pothier B., Qiu D . , 

RA Spadafora R. , Vicare R. , Wang Y., Wierzbowski J., Gibson R., 

RA Jiwani N., Caruso A. , ' Bush D., Safer H., Patwell D., Prabhakar S., 

RA McDougall S., Shimer G., Goyal A., Pietrovski S., Church G.M., 

RA Daniels C.J., Mao J. -I., Rice P., Noelling J., Reeve J.N.; 

RT "Complete genome sequence of Methanobacterium thermoautotrophicum 

RT deltaH: functional analysis and comparative genomics."; 

RL J. Bacterid. 179:7135-7155(1997). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN^Delta H; 

RA Smith D.R. ; 

RL Submitted (AUG-1997) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE OF 1-174 FROM N.A. 

RC STRAIN-Delta H; 

RX MEDLINE-97158688; PubMed-900604 8 ; 

RA Noelling J., Reeve J.N.; 

RT "Growth- and substrate-dependent transcription of the formate 

RT dehydrogenase (fdhCAB) operon in Methanobacterium thermof ormicicum Z- 

RT 245."; 

RL J. Bacteriol. 179:899-908(1997). 

DR EMBL; AE000918; AAB86057.1; -. 

DR EMBL; U51624; AAC44809.1; -. 

DR PIR; D69078; D69078. 

DR GO; GO: 0016868; F: intramolecular transferase activity, phosph. . .; IEA. 

DR GO; GO:0005975; P: carbohydrate metabolism; IEA. 

DR InterPro; IPR005841; PG/PMM_mutase . 

DR InterPro; IPR005844; PG_PMM_ABAI . 

DR InterPro; IPR005845; PG PMM ABAII. 



DR InterPro; IPR005846; PG_PMM_ABAIII . 

DR InterPro; IPR005843; PG_PMM_C. 

DR Pfam; PF00408; PGM_PMM; 1. 

DR Pfam; PF02878; PGM_PMM_I; 1. 

DR Pfam; PF02879; PGM_PMM_II; 1. 

DR Pfam; PF02880; PGM_PMM_III; 1. 

DR PRINTS; PR0050 9; PGMPMM. 

DR PROSITE; PS00710; PGM_PMM; 1. 

KW Complete proteome. 

SQ SEQUENCE 455 AA; 49579 MW; E98E44E9F8A5C40B CRC64; 

Query Match 80.5%; Score 33; DB 17; Length 455; 

Best Local Similarity 71.4%; Pred. No. 1.5e+02; 

Matches 5; Conservative 2; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 VFFAEDF 8 

: I I : II I 

Db 124 IFFSEDF 130 



RESULT 31 
062511 

ID 062511 PRELIMINARY; PRT; 502 AA. 

AC 062511; 

DT 01-AUG-1998 (TrEMBLrel. 07, Created) 

DT 01-AUG-1998 (TrEMBLrel. 07, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE ZK550.2 protein. 

GN ZK550.2. 

OS Caenorhabditis elegans . 

OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea; 

OC Rhabditidae; Peloderinae; Caenorhabditis. 

OX NCBI_TaxID=6239; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Basham V.M. ; 

RL Submitted (NOV-1996) to the EMBL/ GenBank/DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE-99069613; PubMed=98519 16 ; 

RA none; 

RT "Genome sequence of the nematode C. elegans: A platform for 

RT investigating biology."; 

RL Science 282:2012-2018(1998). 

DR EMBL; Z82287; CAB05312.1; 

DR PIR; T27908; T27908. 

DR WormPep; ZK550.2; CE16733. 

DR InterPro; IPR007114; MFS . 

DR PROSITE; PS50850; MFS; 1. 

SQ SEQUENCE 502 AA; 55666 MW; B1B98A5B34254056 CRC64; 

Query Match 80.5%; Score 33; DB 5; Length 502; 

Best Local Similarity 75.0%; Pred. No. 1.7e+02; 

Matches 6; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 LVFFAEDF 8 

I I I I II: 



Db 227 LVFFQEDY 234 



RESULT 32 
Q8THF4 

ID Q8THF4 PRELIMINARY; PRT; 651 AA. 

AC Q8THF4; 

DT 01-JUN-2002 (TrEMBLrel- 21, Created) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Hydantoinase . 

GN HYUA OR MA4 563. 

OS Methanosarcina acetivorans . 

OC Archaea; Euryarchaeota ; Euryarchaeota orders incertae sedis; 

OC Methanosarcinales ; Methanosarcinaceae; Methanosarcina. 

OX NCBI_TaxID=2214; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C2A / ATCC 35395 / DSM 2834; 

RX MEDLINE=2192 97 60; PubMed-1 193223 8 ; 

RA Galagan J.E., Nusbaum C, Roy A., Endrizzi M.G., Macdonald P., 

RA FitzHugh W. , Calvo S., Engels R., Smirnov S., Atnoor D., Brown A., 

RA Allen N., Naylor J., Stange-Thomann N., DeArellano K. , Johnson R., 

RA Linton L., McEwan P., McKernan K., Talamas J., Tirrell A., Ye W., 

RA Ziinmer A., Barber R.D., Cann I., Graham D.E., Grahame D.A. , Guss A.M 

RA Hedderich R. , Ingram- Smith C, Kuettner H.C., Krzycki J. A., 

RA Leigh J. A., Li W., Liu J., Mukhopadhyay B., Reeve J.N., Smith K. , 

RA Springer T.A., Umayam L.A., White 0., White R.H., de Macario E.C., 

RA Ferry J.G., Jarrell K.F., Jing H., Macario A.J.L., Paulsen I., 

RA Pritchett M. , Sowers K.R., Swanson R.V., Zinder S.H., Lander E., 

RA Metcalf W.W., Birren B. ; 

RT "The genome of Methanosarcina acetivorans reveals extensive metaboli 

RT and physiological diversity."; 

RL Genome Res. 12:532-542(2002). 

DR EMBL; AE011177; AAM07902.1; 

DR GO; GO: 0016787; F:hydrolase activity; IEA. 

DR InterPro; IPR002821; Hydantoinase_A. 

DR InterPro; IPR008040; Hydant_A_N. 

DR Pfam; PF01968; Hydantoinase_A; 1. 

DR Pfam; PF05378; Hydant_A_N; 1. 

KW Complete proteome. 

SQ SEQUENCE 651 AA; 71402 MW; 11214F3C3737F79B CRC64 ; 

Query Match 80.5%; Score 33; DB 17; Length 651; 

Best Local Similarity 75.0%; Pred. No. 2.2e+02; 

Matches 6; Conservative 1; Mismatches 1; Indels 0; Gaps 

Qy 1 LVFFAEDF 8 

I llhll 
Db 486 LTFFAKDF 493 



RESULT 33 
Q968T7 

ID Q968T7 PRELIMINARY; PRT; 3610 AA. 

AC Q968T7; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 



DT Ol-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-OCT-2002 (TrEMBLrel . 22, Last annotation update) 

DE Chloroquine resistance marker protein. 

OS Plasmodium falciparum. 

OC Eukaryota; Alveolata; Apicomplexa; Haemosporida; Plasmodium. 

OX NCBI_TaxID=5833; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=K1; 

RA Li G.D., Bray P.G., Ward S.A.; 

RT "Possible chloroquine-resistance gene in Plasmodium falciparum."; 

RL Submitted (JUN-2000) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; AF282241; AAK49448.1; -. 

SQ SEQUENCE 3610 AA; 432028 MW; F6CAE8DD71FBF906 CRC64; 



Query Match 80.5%; Score 33; DB 5; Length 3610; 

Best Local Similarity 71.4%; Pred. No. 1.3e+03; 

Matches 5; Conservative 2; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 VFFAEDF 8 

: I I : I I I 
Db 554 IFFSEDF 560 



RESULT 34 
Q968T6 

ID Q968T6 PRELIMINARY; PRT ; 3620 AA. 

AC Q968T6; 

DT Ol-DEC-2001 (TrEMBLrel. 19, Created) 

DT Ol-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

DE Chloroquine resistance marker protein. 

OS Plasmodium falciparum (isolate Dd2 ) . 

OC Eukaryota; Alveolata; Apicomplexa; Haemosporida; Plasmodium. 

OX NCBI_TaxID=57267; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Dd2 ; 

RA Li G.D., Bray P.G., Ward S.A.; 

RT "Possible chloroquine-resistance gene in Plasmodium falciparum."; 

RL Submitted (JUN-2000) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF282242; AAK49449.1; 

SQ SEQUENCE 3620 AA; 433151 MW; C6519F1F60F72349 CRC64; 

Query Match 80.5%; Score 33; DB 5; Length 3620; 

Best Local Similarity 71.4%; Pred. No. 1.3e+03; 

Matches 5; Conservative 2; Mismatches 0; Indels 0; Gaps 0; 



Qy 2 VFFAEDF 8 

: I I : I I I 

Db 553 IFFSEDF 559 



RESULT 35 
Q968Y0 

ID Q968Y0 PRELIMINARY; PRT; 3 62 8 AA. 

AC Q968Y0; 



DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

DE Chloroquine resistance marker protein. 

OS Plasmodium falciparum. 

OC Eukaryota; Alveolata; Apicomplexa; Haemosporida ; Plasmodium. 

OX NCBI_TaxID=5833; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Li G.D., Bray P.G., Ward S.A. ; 

RT "Plasmodium falciparum chloroquine resistance marker gene."; 

RL Submitted (JUL-1999) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; AF169404; AAK48418.1; -. 

SQ SEQUENCE 3628 AA; 433967 MW; BCDBF5CDA3BA95FB CRC64 ; 



Query Match 80.5%; 
Best Local Similarity 71.4%; 
Matches 5 ; Conservative 



Score 33; DB 5; Length 3628; 
Pred. No. 1.3e+03; 
2; Mismatches 0; Indels 



0; Gaps 



Qy 2 VFFAEDF 8 

: I I : I I I 
Db 554 IFFSEDF 560 



RESULT 36 






Q8IKY8 






ID 


Q8IKY8 PRELIMINARY; PRT; 3704 AA. 






AC 


Q8IKY8; 






DT 


01-MAR-2003 (TrEMBLrel. 23, Created) 






DT 


01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 






DT 


01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 






DE 


Chloroquine resistance marker protein. 






GN 


PF14_0463. 






OS 


Plasmodium falciparum (isolate 3D7) . 






OC 


Eukaryota; Alveolata; Apicomplexa; Haemosporida; Plasmodium. 






OX 


NCBI TaxID=36329; 






RN 


[1] 






RP 


SEQUENCE FROM N.A. 






RC 


STRAIN-3D7 ; 






RX 


MEDLINE=22255705; PubMed-12368864 ; 






RA 


Gardner M.J., Hall N. , Fung E . , White 0., Berriman M. , Hyman R.W 






RA 


Carlton J.M. , Pain A. , Nelson K.E., Bowman S., Paulsen I.T., James 


K 


RA 


Eisen J. A., Rutherford K. , Salzberg S.L., Craig A. , Kyes S., 






RA 


Chan M.-S., Nene V., Shallom S.J., Suh B., Peterson J., Angiuoli 


S. 


i 


RA 


Pertea M. , Allen J., Selengut J., Haft D., Mather M.W., Vaidya-A.B. 


f 


RA 


Martin D.M.A. , Fairlamb A.H., Fraunholz M.J., Roos D.S., Ralph S 


.A. 


r 


RA 


McFadden G.I., Cummings L.M., Subramanian G.M., Mungall C, 






RA 


Venter J.C., Carucci D.J., Hoffman S.L., Newbold C, Davis R.W., 






RA 


Fraser CM., Barrell B. ; 






RT 


"Genome sequence of the human malaria parasite Plasmodium 






RT 


falciparum. " ; 






RL 


Nature 419:498-511(2002). 






DR 


EMBL; AE014823; AAN37076.1; 






SQ 


SEQUENCE 3704 AA; 443067 MW; 27EF423D961E0B77 CRC64; 







Query Match 8 0.5%; Score 33; DB 5; Length 37 04; 

Best Local Similarity 71.4%; Pred. No. 1.3e+03; 



Matches 



5; Conservative 



2; Mismatches 



0; Indels 



0; Gaps 



0; 



Qy 



Db 



2 VFFAEDF 8 

: I I : I I I 

554 IFFSEDF 560 



RESULT 
Q9BHZ8 



37 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RA 
RL 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
SQ 



Q9BHZ8 PRELIMINARY; PRT; 60 AA. 

Q9BHZ8; 

01-JUN-2001 (TrEMBLrel. 17, Created) 

01-JUN-2001 {TrEMBLrel. 17, Last sequence update) 

01-OCT-2003 (TrEMBLrel, 25, Last annotation update) 

Thioredoxin peroxidase, putative (Fragment) . 

THP1. 

Globodera ros tochiensis (Golden nematode) . 

Eukaryota; Metazoa; Nematoda; Chromadorea; Tylenchida; Tylenchina; 
Tylenchoidea; Heteroderidae ; Heteroderinae; Globodera. 
NCBI_TaxI D=3 1243; 
[1] 

SEQUENCE FROM N.A. 

Qin L., Schots A., Smant G., Bakker J., Helder J.; 
Submitted (MAR-1999) to the EMBL/ GenBank/ DDB J databases. 
EMBL; AJ13342 0; CAC34 452.1; 

GO; GO: 0004601; F:peroxidase activity; IEA. 
InterPro; IPR000866; AhpC-TSA. 
Pfam; PF00578; AhpC-TSA; 1. 
Peroxidase. 

NON_TER 1 1 

NON_TER 60 60 

SEQUENCE 60 AA; 6785 



MW; 2BD9A80FFC7895B9 CRC64 ; 



Query Match 78.0%; 
Best Local Similarity 62.5%; 
Matches 5; Conservative 



Score 32; DB 5; Length 60; 
Pred. No. 31; 
2; Mismatches 1; Indels 



0; Gaps 



0; 



Qy 

Db 



1 LVFFAEDF 8 

I : I : III 
36 LIFYPEDF 43 



RESULT 38 
Q90160 

ID Q90160 PRELIMINARY; PRT; 75 AA. 

AC Q90160; 

DT 01-NOV-1996 (TrEMBLrel. 01, Created) 

DT 01-NOV-1996 (TrEMBLrel. 01, Last sequence update) 

DT 01-NOV-1998 (TrEMBLrel. 08, Last annotation update) 

DE Chitinase. 

OS Bombyx mori nuclear polyhedrosis virus (BmNPV) . 

OC Viruses; dsDNA viruses, no RNA stage; Baculoviridae ; 

OC Nucleopolyhedrovirus . 

OX NCBI_TaxID=10458 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=T3 ; 



RX MEDLINE-94365965; PubMed=8 083997 ; 

RA Ohkawa T . , Majima K. , Maeda S.; 

RT "A cysteine protease encoded by the baculovirus Bombyx mori nuclear 

RT polyhedrosis virus."; 

RL J. Virol. 68:6619-6625(1994). 

DR EMBL ; U12688; AAB49543.1; 

SQ SEQUENCE 75 AA; 8403 MW; A082D0827846701E CRC64; 

Query Match 78.0%; Score 32; DB 12; Length 75; 

Best Local Similarity 62.5%; Pred. No. 40; 

Matches 5; Conservative 2; Mismatches 1; Indels 0; Gaps 

Qy 1 LVFFAEDF 8 

I : I : I II 

Db 51 LIFYASDF 58 



RESULT 39 
Q931V3 

ID Q931V3 PRELIMINARY; PRT; 184 AA. 

AC Q931V3; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE GTP pyrophosphokinase . 

GN YJBM OR SAV1001. 

OS Staphylococcus aureus (strain Mu50 / ATCC 700699) . 

OC Bacteria; Firmicutes; Bacillales; Staphylococcus. 

OX NCBI_TaxID=158878; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=21311952; PubMed=1141814 6; 

RA Kuroda M. , Ohta T . , Uchiyama I., Baba T., Yuzawa H., Kobayashi I., 

RA Cui L., Oguchi A., Aoki K.-I., Nagai Y., Lian J--Q. f I to T., 

RA Kanamori M. , Matsumaru H., Maruyama A., Murakami H., Hosoyama A., 

RA Mizutani-Ui Y. f Takahashi N.K., Sawano T., Inoue R.-I., Kaito C, 

RA Sekimizu K. , Hirakawa H., Kuhara S., Goto S., Yabuzaki J., 

RA Kanehisa M. , Yamashita A., Oshima K., Furuya K., Yoshino C, Shiba T 

RA Hattori M. , Ogasawara N., Hayashi H., Hiramatsu K. ; 

RT "Whole genome sequencing of meticillin-resistant Staphylococcus 

RT aureus . " ; 

RL Lancet 357:1225-1240(2001). 

DR EMBL; AP003361; BAB57163.1; -. 

DR GO; GO: 0016301; F: kinase activity; IEA. 

KW Kinase; Complete proteome. 

SQ SEQUENCE 184 AA; 21755 MW; 34 7AD094AFCCCD03 CRC64; 

Query Match 78.0%; Score 32; DB 16; Length 184; 

Best Local Similarity 85.7%; Pred. No. 99; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 
Qy 1 LVFFAED 7 

Nihil 

Db 81 LVFFSED 87 



RESULT 40 



